Skip to Main Content
Table 3: 
Development results of the variations on Graph Transformer. Unlisted values are identical to those of the base model. Both models trained on LDC2015E86 and LDC2017T10 are evaluated.
L1L2dmodeldhPdropBLEUBLEU
(LDC2015E86)(LDC2017T10)
base 256 0.3 25.5 28.8 
 
(A)     20.4 24.6 
    23.7 27.6 
    24.3 28.3 
10     24.6 28.4 
 
(B)     23.4 27.1 
    24.6 28.0 
     24.8 28.7 
 
(C)   512   25.1 28.5 
 
(D)     23.6 28.4 
    23.0 28.7 
 
(E)     0.1 22.7 26.9 
    0.2 25.3 28.5 
     0.4 24.7 27.9 
 
(F) single representation for each node 25.1 28.3 
single representation, inseparate graph attention 23.1 27.6 
L1L2dmodeldhPdropBLEUBLEU
(LDC2015E86)(LDC2017T10)
base 256 0.3 25.5 28.8 
 
(A)     20.4 24.6 
    23.7 27.6 
    24.3 28.3 
10     24.6 28.4 
 
(B)     23.4 27.1 
    24.6 28.0 
     24.8 28.7 
 
(C)   512   25.1 28.5 
 
(D)     23.6 28.4 
    23.0 28.7 
 
(E)     0.1 22.7 26.9 
    0.2 25.3 28.5 
     0.4 24.7 27.9 
 
(F) single representation for each node 25.1 28.3 
single representation, inseparate graph attention 23.1 27.6 
Close Modal

or Create an Account

Close Modal
Close Modal