F1-scores and recall of modules.
. | . | Dataset (Classes, N) . | Accuracy . |
---|---|---|---|
Textual Entailment (R1–R5) | 1 | MNLI (ent/con/neu, 412,349) | F1=82.3 |
2 | AntSyn (ent/con, 15,632) | F1=90.2 | |
3 | Neu50K (neu, 50,000) | R=97.5 | |
4 | MicroAvg (ent/con/neu, 477,981) | F1=84.7 | |
Sentiment Classification (R4–R5) | 5 | SemEval17 (pos/neg/neu, 20,632) | F1=64.5 |
6 | Dong (pos/neg/neu, 6,940) | F1=71.4 | |
7 | Mitchell (pos/neg/neu, 3,288) | F1=62.5 | |
8 | Bakliwal (pos/neg/neu, 2,624) | F1=69.7 | |
9 | Norm (pos/neg, 632) | F1=100.0 | |
10 | MicroAvg (pos/neg/neu, 34,116) | F1=69.2 | |
Causality (R6–R9) | 11 | PDTB (cause/else, 14,224) | F1=68.1 |
12 | PDTB-R (cause/else 1,791) | F1=75.7 | |
13 | BECauSE (cause/obstruct, 1,542) | F1=46.1 | |
14 | BECauSE-R (else, 1,542) | R=86.5 | |
15 | CoNet (cause, 50,420) | R=88.6 | |
16 | CoNet-R (else, 50,420) | R=91.7 | |
17 | WIQA (cause/obstruct, 31,630) | F1=88.2 | |
18 | WIQA-P (else, 31,630) | R=90.2 | |
19 | MicroAvg (cause/obstr/else, 183,119) | F1=87.7 | |
Normative Relation (R10–R13) | 20 | JustType (conseq/norm, 1,580) | F1=90.2 |
21 | ConseqSenti (pos/neg, 824) | F1=71.8 | |
22 | NormType (adv/opp, 758) | F1=91.1 | |
23 | RC-Rel (consist/contra/else, 1,924) | F1=70.1 |
. | . | Dataset (Classes, N) . | Accuracy . |
---|---|---|---|
Textual Entailment (R1–R5) | 1 | MNLI (ent/con/neu, 412,349) | F1=82.3 |
2 | AntSyn (ent/con, 15,632) | F1=90.2 | |
3 | Neu50K (neu, 50,000) | R=97.5 | |
4 | MicroAvg (ent/con/neu, 477,981) | F1=84.7 | |
Sentiment Classification (R4–R5) | 5 | SemEval17 (pos/neg/neu, 20,632) | F1=64.5 |
6 | Dong (pos/neg/neu, 6,940) | F1=71.4 | |
7 | Mitchell (pos/neg/neu, 3,288) | F1=62.5 | |
8 | Bakliwal (pos/neg/neu, 2,624) | F1=69.7 | |
9 | Norm (pos/neg, 632) | F1=100.0 | |
10 | MicroAvg (pos/neg/neu, 34,116) | F1=69.2 | |
Causality (R6–R9) | 11 | PDTB (cause/else, 14,224) | F1=68.1 |
12 | PDTB-R (cause/else 1,791) | F1=75.7 | |
13 | BECauSE (cause/obstruct, 1,542) | F1=46.1 | |
14 | BECauSE-R (else, 1,542) | R=86.5 | |
15 | CoNet (cause, 50,420) | R=88.6 | |
16 | CoNet-R (else, 50,420) | R=91.7 | |
17 | WIQA (cause/obstruct, 31,630) | F1=88.2 | |
18 | WIQA-P (else, 31,630) | R=90.2 | |
19 | MicroAvg (cause/obstr/else, 183,119) | F1=87.7 | |
Normative Relation (R10–R13) | 20 | JustType (conseq/norm, 1,580) | F1=90.2 |
21 | ConseqSenti (pos/neg, 824) | F1=71.8 | |
22 | NormType (adv/opp, 758) | F1=91.1 | |
23 | RC-Rel (consist/contra/else, 1,924) | F1=70.1 |