Experimental results: compared to the baseline Nissim (top) and Rahman (bottom). Bolded scores indicate significant improvements relative to all other models (p < 0.01).
Sign In or Create an Account