Skip to Main Content
Table 3: 

Characteristics of the DoCo dataset.

# PER entities 250 
# ORG entities 270 
# Relations 
# Documents 31,366 
Doc. length range (words) [20, 10906] 
# Unique website domains 600 
# Doc. with non-zero RE tuples 26956 
# Doc. with non-zero coverage 14086 
# Doc. in class informative 7103 (22.6 %) 
# PER entities 250 
# ORG entities 270 
# Relations 
# Documents 31,366 
Doc. length range (words) [20, 10906] 
# Unique website domains 600 
# Doc. with non-zero RE tuples 26956 
# Doc. with non-zero coverage 14086 
# Doc. in class informative 7103 (22.6 %) 
Close Modal

or Create an Account

Close Modal
Close Modal