Skip to Main Content
Table 1 

Europarl corpus size, in sentences and tokens.


Original language
#Sentence
#Tokens
FR-EN French 168,818 4,995,397 
English 134,318 3,441,120 
DE-EN German 200,037 5,571,202 
English 129,309 3,283,298 
IT-EN Italian 69,270 2,535,225 
English 125,640 3,389,736 

Original language
#Sentence
#Tokens
FR-EN French 168,818 4,995,397 
English 134,318 3,441,120 
DE-EN German 200,037 5,571,202 
English 129,309 3,283,298 
IT-EN Italian 69,270 2,535,225 
English 125,640 3,389,736 
Close Modal

or Create an Account

Close Modal
Close Modal