Table 1 

Statistics of the studied corpora.

Le-MondeGutenberg
Number of sentences 172,168 53,996 
Number of phonemes 35 57 
Corpus size (number of phones) 16,668,609 1,539,735 
Number of diphonemes 1,172 1,955 
Number of diphones 16,496,441 1,485,739 
Number of triphonemes 26,443 27,477 
Number of triphones 16,324,273 1,431,743 
Sentence length mean (phones) & Std. Dev. 96.81 (60.46) 28.51 (10.52) 
Le-MondeGutenberg
Number of sentences 172,168 53,996 
Number of phonemes 35 57 
Corpus size (number of phones) 16,668,609 1,539,735 
Number of diphonemes 1,172 1,955 
Number of diphones 16,496,441 1,485,739 
Number of triphonemes 26,443 27,477 
Number of triphones 16,324,273 1,431,743 
Sentence length mean (phones) & Std. Dev. 96.81 (60.46) 28.51 (10.52) 
Close Modal

or Create an Account

Close Modal
Close Modal