Table 1: 

Corpus-based measures of morphology defined for this study. These measures are calculated on tokenized data sets before applying any segmentation method.

MeasureDefinition
Types Number of unique word tokens 
TTR Number of unique word tokens divided by total 
 number of word tokens 
MATTR Average TTR calculated over a moving window 
 of 500 word tokens 
MLW Average number of characters per word token 
MeasureDefinition
Types Number of unique word tokens 
TTR Number of unique word tokens divided by total 
 number of word tokens 
MATTR Average TTR calculated over a moving window 
 of 500 word tokens 
MLW Average number of characters per word token 
Close Modal

or Create an Account

Close Modal
Close Modal