Rewrite probabilities learned for English, averaged over the last 4 epochs on πΎπ treebank (blue bars) or πΎπ_πΎππ treebank (orange bars). The header above each figure is the underlying punctuation string (input to NoisyChannel). The two counts in the figure headers are the number of occurrences of the underlying punctuation strings in the 1-best reconstruction of underlying punctuation sequences (by Algorithm 1) respectively in the πΎπ and πΎπ_πΎππ treebank. Each bar represents one surface punctuation string (output of NoisyChannel), its height giving the probability.
This site uses cookies. By continuing to use our website, you are agreeing to our privacy policy. No content on this site may be used to train artificial intelligence systems without permission in writing from the MIT Press.