Statistics of the studied corpora.
. | Le-Monde . | Gutenberg . |
---|---|---|
Number of sentences | 172,168 | 53,996 |
Number of phonemes | 35 | 57 |
Corpus size (number of phones) | 16,668,609 | 1,539,735 |
Number of diphonemes | 1,172 | 1,955 |
Number of diphones | 16,496,441 | 1,485,739 |
Number of triphonemes | 26,443 | 27,477 |
Number of triphones | 16,324,273 | 1,431,743 |
Sentence length mean (phones) & Std. Dev. | 96.81 (60.46) | 28.51 (10.52) |
. | Le-Monde . | Gutenberg . |
---|---|---|
Number of sentences | 172,168 | 53,996 |
Number of phonemes | 35 | 57 |
Corpus size (number of phones) | 16,668,609 | 1,539,735 |
Number of diphonemes | 1,172 | 1,955 |
Number of diphones | 16,496,441 | 1,485,739 |
Number of triphonemes | 26,443 | 27,477 |
Number of triphones | 16,324,273 | 1,431,743 |
Sentence length mean (phones) & Std. Dev. | 96.81 (60.46) | 28.51 (10.52) |