Skip to Main Content
Table 4: 

StrategyQA statistics. Filtered questions were rejected by the solvers (§3.1). The train and test sets of question writers are disjoint. The “top trigram” is the most common trigram.

TrainTest
# of questions 2290 490 
% “yes” questions 46.8% 46.1% 
# of unique terms 1333 442 
# of unique decomposition steps 6050 1347 
# of unique evidence paragraphs 9251 2136 
# of occurrences of the top trigram 31 
 
# of question writers 23 
# of filtered questions 2821 484 
 
Avg. question length (words) 9.6 9.8 
Avg. decomposition length (steps) 2.93 2.92 
Avg. # of paragraphs per question 2.33 2.29 
TrainTest
# of questions 2290 490 
% “yes” questions 46.8% 46.1% 
# of unique terms 1333 442 
# of unique decomposition steps 6050 1347 
# of unique evidence paragraphs 9251 2136 
# of occurrences of the top trigram 31 
 
# of question writers 23 
# of filtered questions 2821 484 
 
Avg. question length (words) 9.6 9.8 
Avg. decomposition length (steps) 2.93 2.92 
Avg. # of paragraphs per question 2.33 2.29 
Close Modal

or Create an Account

Close Modal
Close Modal