Figure 1: 
Ratio of the held-out perplexity on a document completion task and the topic coherence as a function of the vocabulary size for the etm and lda on the 20NewsGroup corpus. The perplexity is normalized by the size of the vocabulary. While the performance of lda deteriorates for large vocabularies, the etm maintains good performance.

Ratio of the held-out perplexity on a document completion task and the topic coherence as a function of the vocabulary size for the etm and lda on the 20NewsGroup corpus. The perplexity is normalized by the size of the vocabulary. While the performance of lda deteriorates for large vocabularies, the etm maintains good performance.

Close Modal

or Create an Account

Close Modal
Close Modal