The ratio of top frequent input embeddings to the total occurrences of input embeddings in training data. Obviously, multiple features significantly alleviate the problem of sparse data.
Proportion (%) . | TOP 100 . | TOP 1K . | TOP 2K . |
---|---|---|---|
Character Only | 81.93 | 98.60 | 99.64 |
w/ Multi-Feature | 53.54 | 75.30 | 79.99 |
Proportion (%) . | TOP 100 . | TOP 1K . | TOP 2K . |
---|---|---|---|
Character Only | 81.93 | 98.60 | 99.64 |
w/ Multi-Feature | 53.54 | 75.30 | 79.99 |