Symmetry evaluations of metrics. SemBleu (left column) and Smatch (middle column) and Bleu as a ‘baseline’ in an MT task setting on newstest2018. SemBleu: large divergence, strong outliers. Smatch: few divergences, few outliers; Bleu: many small divergences, zero outliers. (a) marks the case in Figure 3.
This site uses cookies. By continuing to use our website, you are agreeing to our privacy policy. No content on this site may be used to train artificial intelligence systems without permission in writing from the MIT Press.