Skip to Main Content
Table 8 

Predicted label breakdown for the crawled data, over the four varieties of Arabic. All varieties were given equal priors.

Variety
Sentence Count
Percentage
MSA 13,102,427 71.9% 
LEV 3,636,525 20.0% 
GLF 630,726 3.5% 
EGY 849,670 4.7% 
ALL 18,219,348 100.0% 
Variety
Sentence Count
Percentage
MSA 13,102,427 71.9% 
LEV 3,636,525 20.0% 
GLF 630,726 3.5% 
EGY 849,670 4.7% 
ALL 18,219,348 100.0% 
Close Modal

or Create an Account

Close Modal
Close Modal