Skip to Main Content
Table 4 

Comparison of the PDTB and comparably annotated corpora. Count is the number of annotated relations; Coverage is the text genre(s) in the corpus; Mods=Y if connective modifiers are annotated. Impl=Y if implicit connectives are annotated. EntR=Y if Entity Relations are annotated. AltL=Y if Alternative Lexicalizations are annotated. Attr=Y if attribution is annotated. Supp=Y if arguments can have supplementary text. Sens=Y if senses have been annotated. Mult=Y if multiple sense relations can be annotated for a single connective.

NameCoverageCountModsImplEntRAltLAttrSuppSensMult
PDTB WSJ news, essays 40,600 
BioDRB Biomed papers 5,859 
LADTB Arabic news 6,328 N1 
Chinese DTB Xinhua news 3,951 Y2 
Turkish DB novels, news, etc. 8,484 
Hindi DRB news ∼5K 
PDT 3.0 news 20,542 Y3 
(PDiT 1.0)           
NameCoverageCountModsImplEntRAltLAttrSuppSensMult
PDTB WSJ news, essays 40,600 
BioDRB Biomed papers 5,859 
LADTB Arabic news 6,328 N1 
Chinese DTB Xinhua news 3,951 Y2 
Turkish DB novels, news, etc. 8,484 
Hindi DRB news ∼5K 
PDT 3.0 news 20,542 Y3 
(PDiT 1.0)           

1∼70% of adjacent sentences in the LADTB are linked by an explicit connective, compared with ∼12% in the PDTB.

2In 20 randomly selected files, over 80% of DRels were found to be implicit, compared with around 54.5% in the PDTB (Zhou and Xue 2012).

3Included in coreference annotation.

Close Modal

or Create an Account

Close Modal
Close Modal