F1-scores (%) obtained on the coverage prediction task by various heuristics and methods.
Method . | PER . | ORG . | Avg. . | ||||||
---|---|---|---|---|---|---|---|---|---|
member-of . | family . | edu-at . | position-held . | partner-org . | founded-by . | ceo . | board-member . | ||
Random (biased) | 5.7 | 6.8 | 4.9 | 10.0 | 7.5 | 1.2 | 13.5 | 3.7 | 6.6 |
Random (fair) | 15.7 | 11.1 | 12.6 | 15.4 | 15.2 | 8.9 | 21.3 | 7.2 | 13.4 |
Text Complexity | 9.6 | 5.4 | 6.1 | 10.3 | 3.5 | 3.3 | 15 | 5.4 | 7.3 |
Alexa Ranking | 12.6 | 9.8 | 8.1 | 12.4 | 16.7 | 11.3 | 24.8 | 7.3 | 12.9 |
Entity Saliency | 17.8 | 14.3 | 11.9 | 18.2 | 14.7 | 8.4 | 24.6 | 7.1 | 14.6 |
Document Length | 20.5 | 19.0 | 15.5 | 21.9 | 23.9 | 12.8 | 28.8 | 8.5 | 18.9 |
NER Count | 24.3 | 19.8 | 18.2 | – | 21.1 | 13.7 | 34.5 | 11.8 | 20.5 |
BM25 IR | 27.1 | 21.1 | 18.8 | 26.3 | 21.8 | 12.9 | 36.6 | 12.1 | 22.1 |
T5 IR | 26.9 | 23.2 | 20.3 | 29.6 | 19.5 | 15.4 | 41.1 | 13.1 | 23.6 |
LDA Topic Model | 19.3 | 19.0 | 14.5 | 21.1 | 15.7 | 8.6 | 25.2 | 11.5 | 16.9 |
GloVe+LSTM | 16.5 | 28.6 | 19.8 | 32.9 | 24.2 | 19.5 | 24.4 | 4.9 | 21.3 |
Ngrams+TFIDF | 36.2 | 40.0 | 25.6 | 40.2 | 18.6 | 25.5 | 41.8 | 30.2 | 32.3 |
BOW+TFIDF | 36.0 | 41.0 | 29.2 | 42.1 | 17.2 | 28.3 | 40.6 | 32.1 | 33.3 |
BERT | 40.4 | 39.7 | 35.7 | 44.4 | 22.0 | 30.8 | 43.0 | 33.8 | 36.2 |
Heu+TFIDF | 41.9 | 43.5 | 31.3 | 36.5 | 35.1 | 28.2 | 41.4 | 22.0 | 35.0 |
HERB | 44.2 | 41.7 | 40.5 | 45.6 | 28.8 | 32.5 | 46.2 | 34.8 | 39.3 |
Method . | PER . | ORG . | Avg. . | ||||||
---|---|---|---|---|---|---|---|---|---|
member-of . | family . | edu-at . | position-held . | partner-org . | founded-by . | ceo . | board-member . | ||
Random (biased) | 5.7 | 6.8 | 4.9 | 10.0 | 7.5 | 1.2 | 13.5 | 3.7 | 6.6 |
Random (fair) | 15.7 | 11.1 | 12.6 | 15.4 | 15.2 | 8.9 | 21.3 | 7.2 | 13.4 |
Text Complexity | 9.6 | 5.4 | 6.1 | 10.3 | 3.5 | 3.3 | 15 | 5.4 | 7.3 |
Alexa Ranking | 12.6 | 9.8 | 8.1 | 12.4 | 16.7 | 11.3 | 24.8 | 7.3 | 12.9 |
Entity Saliency | 17.8 | 14.3 | 11.9 | 18.2 | 14.7 | 8.4 | 24.6 | 7.1 | 14.6 |
Document Length | 20.5 | 19.0 | 15.5 | 21.9 | 23.9 | 12.8 | 28.8 | 8.5 | 18.9 |
NER Count | 24.3 | 19.8 | 18.2 | – | 21.1 | 13.7 | 34.5 | 11.8 | 20.5 |
BM25 IR | 27.1 | 21.1 | 18.8 | 26.3 | 21.8 | 12.9 | 36.6 | 12.1 | 22.1 |
T5 IR | 26.9 | 23.2 | 20.3 | 29.6 | 19.5 | 15.4 | 41.1 | 13.1 | 23.6 |
LDA Topic Model | 19.3 | 19.0 | 14.5 | 21.1 | 15.7 | 8.6 | 25.2 | 11.5 | 16.9 |
GloVe+LSTM | 16.5 | 28.6 | 19.8 | 32.9 | 24.2 | 19.5 | 24.4 | 4.9 | 21.3 |
Ngrams+TFIDF | 36.2 | 40.0 | 25.6 | 40.2 | 18.6 | 25.5 | 41.8 | 30.2 | 32.3 |
BOW+TFIDF | 36.0 | 41.0 | 29.2 | 42.1 | 17.2 | 28.3 | 40.6 | 32.1 | 33.3 |
BERT | 40.4 | 39.7 | 35.7 | 44.4 | 22.0 | 30.8 | 43.0 | 33.8 | 36.2 |
Heu+TFIDF | 41.9 | 43.5 | 31.3 | 36.5 | 35.1 | 28.2 | 41.4 | 22.0 | 35.0 |
HERB | 44.2 | 41.7 | 40.5 | 45.6 | 28.8 | 32.5 | 46.2 | 34.8 | 39.3 |