Skip to Main Content
Table 13 

Additional metrics to evaluate the performance of argument component identification applied to the results of 10-fold cross-validation scenario (Table 9). *Measured only on a subset of the data (refer to Section 4.4.6).

Macro-F1Krippendorff's αUBoundary similarity
Human 0.60 0.48* 0.70 
Baseline 0.16 0.11 0.18 
Best system 0.25 0.30 0.32 
Macro-F1Krippendorff's αUBoundary similarity
Human 0.60 0.48* 0.70 
Baseline 0.16 0.11 0.18 
Best system 0.25 0.30 0.32 
Close Modal

or Create an Account

Close Modal
Close Modal