Skip to Main Content
Table 1: 

Dataset statistics for RotoWire and MLB. Vocabulary size, number of tokens, number of instances (i.e., table-summary pairs), number of record types, average number of records, average number of paragraph plans, and average summary length.

RotoWireMLB
Vocab Size 11.3K 38.9K 
# Tokens 1.5M 14.3M 
# Instances 4.9K 26.3K 
# Record Types 39 53 
Avg Records 628 565 
Avg Paragraph Plans 10.7 15.1 
Avg Length 337.1 542.05 
RotoWireMLB
Vocab Size 11.3K 38.9K 
# Tokens 1.5M 14.3M 
# Instances 4.9K 26.3K 
# Record Types 39 53 
Avg Records 628 565 
Avg Paragraph Plans 10.7 15.1 
Avg Length 337.1 542.05 
Close Modal

or Create an Account

Close Modal
Close Modal