Accuracy on HANS when a small fraction of training data is removed. Removing non-entailment examples with high premise-hypothesis overlap significantly hurt performance compared with removing examples uniformly at random.
This site uses cookies. By continuing to use our website, you are agreeing to our privacy policy. No content on this site may be used to train artificial intelligence systems without permission in writing from the MIT Press.