. | . | Evaluation (Test) Dataset . | |||||||
---|---|---|---|---|---|---|---|---|---|
Model . | Training Dataset . | . | . | . | . | ||||
. | . | EM . | F1 . | EM . | F1 . | EM . | F1 . | EM . | F1 . |
BiDAF | 56.7 0.5 | 70.1 0.3 | 11.61.0 | 21.31.1 | 8.60.6 | 17.30.8 | 8.30.7 | 16.80.5 | |
+ | 56.30.6 | 69.70.4 | 14.40.9 | 24.40.9 | 15.61.1 | 24.71.1 | 14.30.5 | 23.30.7 | |
+ | 56.20.6 | 69.40.6 | 14.40.7 | 24.20.8 | 15.70.6 | 25.10.6 | 13.90.8 | 22.70.8 | |
+ | 56.20.7 | 69.60.6 | 14.7 0.9 | 24.8 0.8 | 17.9 0.5 | 26.7 0.6 | 16.7 1.1 | 25.0 0.8 | |
BERT | 74.80.3 | 86.90.2 | 46.40.7 | 60.50.8 | 24.41.2 | 35.91.1 | 17.30.7 | 28.90.9 | |
+ | 75.20.4 | 87.2 0.2 | 52.40.9 | 66.50.9 | 40.91.3 | 51.21.5 | 32.90.9 | 44.10.8 | |
+ | 75.10.3 | 87.10.3 | 54.1 1.0 | 68.0 0.8 | 43.71.1 | 54.11.3 | 34.70.7 | 45.70.8 | |
+ | 75.3 0.4 | 87.10.3 | 53.01.1 | 67.10.8 | 44.1 1.1 | 54.4 0.9 | 36.6 0.8 | 47.8 0.5 | |
RoBERTa | 73.20.4 | 86.30.2 | 48.91.1 | 64.31.1 | 31.31.1 | 43.51.2 | 16.10.8 | 26.70.9 | |
+ | 73.9 0.4 | 86.7 0.2 | 55.01.4 | 69.70.9 | 46.51.1 | 57.31.1 | 31.90.8 | 42.41.0 | |
+ | 73.80.2 | 86.7 0.2 | 55.41.0 | 70.10.9 | 48.91.0 | 59.01.2 | 32.91.3 | 43.71.4 | |
+ | 73.50.3 | 86.50.2 | 55.9 0.7 | 70.6 0.7 | 49.1 1.2 | 59.5 1.2 | 34.7 1.0 | 45.9 1.2 |
. | . | Evaluation (Test) Dataset . | |||||||
---|---|---|---|---|---|---|---|---|---|
Model . | Training Dataset . | . | . | . | . | ||||
. | . | EM . | F1 . | EM . | F1 . | EM . | F1 . | EM . | F1 . |
BiDAF | 56.7 0.5 | 70.1 0.3 | 11.61.0 | 21.31.1 | 8.60.6 | 17.30.8 | 8.30.7 | 16.80.5 | |
+ | 56.30.6 | 69.70.4 | 14.40.9 | 24.40.9 | 15.61.1 | 24.71.1 | 14.30.5 | 23.30.7 | |
+ | 56.20.6 | 69.40.6 | 14.40.7 | 24.20.8 | 15.70.6 | 25.10.6 | 13.90.8 | 22.70.8 | |
+ | 56.20.7 | 69.60.6 | 14.7 0.9 | 24.8 0.8 | 17.9 0.5 | 26.7 0.6 | 16.7 1.1 | 25.0 0.8 | |
BERT | 74.80.3 | 86.90.2 | 46.40.7 | 60.50.8 | 24.41.2 | 35.91.1 | 17.30.7 | 28.90.9 | |
+ | 75.20.4 | 87.2 0.2 | 52.40.9 | 66.50.9 | 40.91.3 | 51.21.5 | 32.90.9 | 44.10.8 | |
+ | 75.10.3 | 87.10.3 | 54.1 1.0 | 68.0 0.8 | 43.71.1 | 54.11.3 | 34.70.7 | 45.70.8 | |
+ | 75.3 0.4 | 87.10.3 | 53.01.1 | 67.10.8 | 44.1 1.1 | 54.4 0.9 | 36.6 0.8 | 47.8 0.5 | |
RoBERTa | 73.20.4 | 86.30.2 | 48.91.1 | 64.31.1 | 31.31.1 | 43.51.2 | 16.10.8 | 26.70.9 | |
+ | 73.9 0.4 | 86.7 0.2 | 55.01.4 | 69.70.9 | 46.51.1 | 57.31.1 | 31.90.8 | 42.41.0 | |
+ | 73.80.2 | 86.7 0.2 | 55.41.0 | 70.10.9 | 48.91.0 | 59.01.2 | 32.91.3 | 43.71.4 | |
+ | 73.50.3 | 86.50.2 | 55.9 0.7 | 70.6 0.7 | 49.1 1.2 | 59.5 1.2 | 34.7 1.0 | 45.9 1.2 |