Skip to Main Content
Table 3: 
The purity, homogeneity, and variation of information (VI) scores for the latent discourse roles measured against the human-annotated dialogue acts. For purity and homogeneity, higher scores indicate better performance, while for VI scores, lower is better. In each column, the best results are in boldface. Our joint model Topic+Disc significantly outperforms all the baselines ( p < 0.01, paired t-test).
ModelsPurityHomogeneityVI
Baselines    
LAED 0.505 0.022 6.418 
Li et al. (2018) 0.511 0.096 5.540 
Our models    
Disc only 0.510 0.112 5.532 
Topic+Disc 0.521 0.142 5.097 
ModelsPurityHomogeneityVI
Baselines    
LAED 0.505 0.022 6.418 
Li et al. (2018) 0.511 0.096 5.540 
Our models    
Disc only 0.510 0.112 5.532 
Topic+Disc 0.521 0.142 5.097 
Close Modal

or Create an Account

Close Modal
Close Modal