Skip to Main Content
Table 4

Data set statistics for all tasks, languages, and splits. sampled: Ranges between 250 and 1,000. *: No original development set.

  #training#dev#test
SRL Czech #sampled 5,228 4,213 
Catalan #sampled 1,724 1,862 
Spanish #sampled 1,655 1,725 
Turkish #sampled 844 842 
Finnish #sampled 716 648 
  
POS & DEP Vietnamese 1,400 800 800 
Telugu 1,051 131 146 
Tamil 400 80 120 
Belarusian 319 65 253 
Kazakh* 23 1,047 
Kurmanji* 15 734 
Buryat* 14 908 
  #training#dev#test
SRL Czech #sampled 5,228 4,213 
Catalan #sampled 1,724 1,862 
Spanish #sampled 1,655 1,725 
Turkish #sampled 844 842 
Finnish #sampled 716 648 
  
POS & DEP Vietnamese 1,400 800 800 
Telugu 1,051 131 146 
Tamil 400 80 120 
Belarusian 319 65 253 
Kazakh* 23 1,047 
Kurmanji* 15 734 
Buryat* 14 908 
Close Modal

or Create an Account

Close Modal
Close Modal