Ratio of the held-out perplexity on a document completion task and the topic coherence as a function of the vocabulary size for the etm and lda on the 20NewsGroup corpus. The perplexity is normalized by the size of the vocabulary. While the performance of lda deteriorates for large vocabularies, the etm maintains good performance.
This site uses cookies. By continuing to use our website, you are agreeing to our privacy policy. No content on this site may be used to train artificial intelligence systems without permission in writing from the MIT Press.