Skip to Main Content
Table 1 
Summary of parallel corpora extracted from EW and SEW. An original sentence can be aligned to one (1-to-1) or more (1-to-N) unique simplified sentences. A (*) indicates that some aligned simplified sentences may not be unique.
CorporaInstancesAlignment Types
PWKP (Zhu, Bernhard, and Gurevych 2010) 108K 1-to-1, 1-to-N 
C&K-1 (Coster and Kauchak 2011b) 137K 1-to-1, 1-to-N 
RevisionWL (Woodsend and Lapata 2011a) 15K 1-to-1*, 1-to-N*, N-to-1* 
AlignedWL (Woodsend and Lapata 2011a) 142K 1-to-1, 1-to-N 
C&K-2 (Kauchak 2013) 167K 1-to-1, 1-to-N 
EW-SEW (Hwang et al. 2015) 392K 1-to-1 
sscorpus (Kajiwara and Komachi 2016) 493K 1-to-1 
WikiLarge (Zhang and Lapata 2017) 286K 1-to-1*, 1-to-N*, N-to-1* 
CorporaInstancesAlignment Types
PWKP (Zhu, Bernhard, and Gurevych 2010) 108K 1-to-1, 1-to-N 
C&K-1 (Coster and Kauchak 2011b) 137K 1-to-1, 1-to-N 
RevisionWL (Woodsend and Lapata 2011a) 15K 1-to-1*, 1-to-N*, N-to-1* 
AlignedWL (Woodsend and Lapata 2011a) 142K 1-to-1, 1-to-N 
C&K-2 (Kauchak 2013) 167K 1-to-1, 1-to-N 
EW-SEW (Hwang et al. 2015) 392K 1-to-1 
sscorpus (Kajiwara and Komachi 2016) 493K 1-to-1 
WikiLarge (Zhang and Lapata 2017) 286K 1-to-1*, 1-to-N*, N-to-1* 
Close Modal

or Create an Account

Close Modal
Close Modal