PAQ dataset statistics and ODQA dataset answer coverage. “Ratio” refers to the number of generated questions which pass the global consistency filter.
Dataset . | Extracted Answers . | Unique Qs . | Filtered QAs . | Ratio . | Coverage . | |
---|---|---|---|---|---|---|
. | . | . | . | . | NQ . | TQA . |
PAQL,1 | 76.4M | 58.0M | 14.1M | 24.4% | 88.3 | 90.2 |
PAQL,4 | 76.4M | 225.2M | 53.8M | 23.9% | 89.8 | 90.9 |
PAQNE,1 | 122.2M | 65.4M | 12.0M | 18.6% | 83.5 | 88.3 |
PAQ | 165.7M | 279.2M | 64.9M | 23% | 90.2 | 91.1 |
Dataset . | Extracted Answers . | Unique Qs . | Filtered QAs . | Ratio . | Coverage . | |
---|---|---|---|---|---|---|
. | . | . | . | . | NQ . | TQA . |
PAQL,1 | 76.4M | 58.0M | 14.1M | 24.4% | 88.3 | 90.2 |
PAQL,4 | 76.4M | 225.2M | 53.8M | 23.9% | 89.8 | 90.9 |
PAQNE,1 | 122.2M | 65.4M | 12.0M | 18.6% | 83.5 | 88.3 |
PAQ | 165.7M | 279.2M | 64.9M | 23% | 90.2 | 91.1 |