The five baselines. The subscript indicates the parameters found on IN2EN-dev and used for IN2EN-test. The scores that are statistically significantly better than ML2EN and IN2EN (p < 0.01, Collins' sign test) are shown in bold and are underlined, respectively.