Skip to Main Content
Table A.1 

Top 10 source domains in the raw corpus.

DomainDocsDomainDocs
living.msn.com 2040 www.theage.com.au 196 
discussion.theguardian.com 494 www.forerunner.com 169 
community.babycenter.com 403 www.netmums.com 117 
www.washingtonpost.com 398 schoolsofthought.blogs.cnn.com 117 
www.cbc.ca 380 www.greatschools.org 89 
DomainDocsDomainDocs
living.msn.com 2040 www.theage.com.au 196 
discussion.theguardian.com 494 www.forerunner.com 169 
community.babycenter.com 403 www.netmums.com 117 
www.washingtonpost.com 398 schoolsofthought.blogs.cnn.com 117 
www.cbc.ca 380 www.greatschools.org 89 
Close Modal

or Create an Account

Close Modal
Close Modal