Skip to Main Content
Table 2: 
Sizes of vocabularies. EN-ori represents original English sentences without BPE.
DatasetEN-oriENAMRDE
NC-v11 79.8K 8.4K 36.6K 8.3K 
Full 874K 19.3K 403K 19.1K 
DatasetEN-oriENAMRDE
NC-v11 79.8K 8.4K 36.6K 8.3K 
Full 874K 19.3K 403K 19.1K 
Close Modal

or Create an Account

Close Modal
Close Modal