Skip to Main Content
Table 1 

Statistics of the monolingual corpora.


English
French
Hebrew
Tokens 447,073,250 522,964,336 46,239,285 
Types 2,421,181 2,416,269 188,572 
Bigram tokens 429,550,149 505,441,224 45,858,152 
Bigram types 22,929,768 21,428,007 5,698,581 

English
French
Hebrew
Tokens 447,073,250 522,964,336 46,239,285 
Types 2,421,181 2,416,269 188,572 
Bigram tokens 429,550,149 505,441,224 45,858,152 
Bigram types 22,929,768 21,428,007 5,698,581 
Close Modal

or Create an Account

Close Modal
Close Modal