A summary of the different components of the AOC data set. Overall, 1.4M comments were harvested from 86.1K articles, corresponding to 52.1M words.
News Source . | Al-Ghad . | Al-Riyadh . | Al-Youm Al-Sabe' . | ALL . |
---|---|---|---|---|
# articles | 6.30K | 34.2K | 45.7K | 86.1K |
# comments | 26.6K | 805K | 565K | 1.4M |
# sentences | 63.3K | 1,686K | 1,384K | 3.1M |
# words | 1.24M | 18.8M | 32.1M | 52.1M |
comments/article | 4.23 | 23.56 | 12.37 | 16.21 |
sentences/comment | 2.38 | 2.09 | 2.45 | 2.24 |
words/sentence | 19.51 | 11.14 | 23.22 | 16.65 |
News Source . | Al-Ghad . | Al-Riyadh . | Al-Youm Al-Sabe' . | ALL . |
---|---|---|---|---|
# articles | 6.30K | 34.2K | 45.7K | 86.1K |
# comments | 26.6K | 805K | 565K | 1.4M |
# sentences | 63.3K | 1,686K | 1,384K | 3.1M |
# words | 1.24M | 18.8M | 32.1M | 52.1M |
comments/article | 4.23 | 23.56 | 12.37 | 16.21 |
sentences/comment | 2.38 | 2.09 | 2.45 | 2.24 |
words/sentence | 19.51 | 11.14 | 23.22 | 16.65 |