Skip to Main Content
Table 4: 

Test set results for models finetuned on the CronQuestions dataset in a closed-book manner. “None” refers to finetuning the T5 baseline; the “2018” model is adapted to the 2018 slice of CustomNews.

SizeModelEMF1
Small None 3.63 9.51 
Uniform 4.01 10.27 
Temporal 4.05 10.20 
 
Large None 4.10 10.78 
2018 4.39 10.87 
Uniform 4.70 11.34 
Temporal 5.13 11.93 
 
XXL None 5.44 12.19 
Uniform 5.71 12.61 
Temporal 5.81 12.88 
SizeModelEMF1
Small None 3.63 9.51 
Uniform 4.01 10.27 
Temporal 4.05 10.20 
 
Large None 4.10 10.78 
2018 4.39 10.87 
Uniform 4.70 11.34 
Temporal 5.13 11.93 
 
XXL None 5.44 12.19 
Uniform 5.71 12.61 
Temporal 5.81 12.88 
Close Modal

or Create an Account

Close Modal
Close Modal