Table 1: 

Mention level F1 scores of models on CoNLL and OntoNotes, as well as on NRB and WTS.

ModelCoNLLOntoNotes
DevTestNRBWTSDevTestNRBWTS
Feature-based 
Flair-LSTM – 93.03 27.56 99.58 – 89.06 33.67 93.98 
ELMo-LSTM 96.69 92.47 31.65 98.24 88.31 89.38 34.34 94.90 
BERT-LSTM 95.94 91.94 38.34 98.08 86.12 87.28 43.07 92.04 
Fine-tuning 
BERT-base 96.18 92.19 75.54 98.67 87.23 88.19 75.34 94.22 
BERT-large 96.90 92.86 75.55 98.51 89.26 89.93 75.41 95.06 
ModelCoNLLOntoNotes
DevTestNRBWTSDevTestNRBWTS
Feature-based 
Flair-LSTM – 93.03 27.56 99.58 – 89.06 33.67 93.98 
ELMo-LSTM 96.69 92.47 31.65 98.24 88.31 89.38 34.34 94.90 
BERT-LSTM 95.94 91.94 38.34 98.08 86.12 87.28 43.07 92.04 
Fine-tuning 
BERT-base 96.18 92.19 75.54 98.67 87.23 88.19 75.34 94.22 
BERT-large 96.90 92.86 75.55 98.51 89.26 89.93 75.41 95.06 
Close Modal

or Create an Account

Close Modal
Close Modal