. | No-KD . | UG-KD . |
---|---|---|
Validation Set (Per-task average / 1-best random seed) | ||
CoLA | 50.7 / 60.2 | 54.3 / 60.6 |
7-task avg. (excl. CoLA) | 85.4 / 87.8 | 84.8 / 86.9 |
Overall 8-task avg. | 81.1 / 84.4 | 81.0 / 83.6 |
Test set (Per-task 1-best random seed on validation set) | ||
CoLA | 53.1 | 55.3 |
7-task avg. (excl. CoLA) | 84.2 | 83.5 |
Overall 8-task avg. | 80.3 | 80.0 |
. | No-KD . | UG-KD . |
---|---|---|
Validation Set (Per-task average / 1-best random seed) | ||
CoLA | 50.7 / 60.2 | 54.3 / 60.6 |
7-task avg. (excl. CoLA) | 85.4 / 87.8 | 84.8 / 86.9 |
Overall 8-task avg. | 81.1 / 84.4 | 81.0 / 83.6 |
Test set (Per-task 1-best random seed on validation set) | ||
CoLA | 53.1 | 55.3 |
7-task avg. (excl. CoLA) | 84.2 | 83.5 |
Overall 8-task avg. | 80.3 | 80.0 |