Table 4: 
An example of the CCG supertag predictions for the verb “use” from four different BERT variants. The correct answer is “((S[b]∖NP)/PP)/NP”, which both the R2L-KD and UG-KD predict correctly (blue). However, the No-KD baseline and the L2R-KD model produce (the same) incorrect predictions (red); both models fail to subcategorize for the prepositional phrase “as screens” as a dependent of the verb “use”. Beyond this, all four models predict the correct supertags for all other words (not shown).
Sentence InputNo-KD & L2R-KD Pred.R2L-KD & UG-KD Pred.
“Apple II owners , for example , had to use their TVsets as screens and stored data on audiocassettes” (S[b]∖NP)/NP ((S[b]∖NP)/PP)/NP 
Sentence InputNo-KD & L2R-KD Pred.R2L-KD & UG-KD Pred.
“Apple II owners , for example , had to use their TVsets as screens and stored data on audiocassettes” (S[b]∖NP)/NP ((S[b]∖NP)/PP)/NP 
Close Modal

or Create an Account

Close Modal
Close Modal