Skip to Main Content
Table 1: 

Dataset statistics for INSteD. Numbers in parentheses are percentages of incoherent documents.

Source#docsavg.avg.
#sents#tokens
Wikipedia 106,352 (46%) 5±1 126±24 
CNN 72,670 (49%) 5±1 134±32 
Source#docsavg.avg.
#sents#tokens
Wikipedia 106,352 (46%) 5±1 126±24 
CNN 72,670 (49%) 5±1 134±32 
Close Modal

or Create an Account

Close Modal
Close Modal