The modeling of the factuality profiler put forward here has been implemented and evaluated against a corpus annotated for that purpose. The resulting tool, called De Facto, integrates the algorithm in the previous section, along with the linguistic resources with lexical and syntactic information structured as presented in Section 3.4, and articulated around the scalar definition of factuality values developed in Section 3.2. The approach is therefore entirely symbolic, involving lexical look-up while top–down traversing the dependency tree of each sentence. The lexical resources informing De Facto include those listed here. They will be made available to the community in the near future.
Polarity particles: A total of 11 negation particles distributed among adverbs (such as not, neither), determiners (no, non), and pronouns (none, nobody), together with the table on contextual polarity interactions (Table 2).
Modality particles: The set of 31 particles presented in Example (15), each accompanied with their default modality interpretation, as well as their interaction table (Table 3).
ESPs: The lexical entries for a total of 646 ESPs, distributed as shown in Table 6. Lexical entries structure their factuality information as illustrated in Tables 4 and 5 (for SIPs and NSIPs, respectively). The information in each lexical entry was compiled manually in a data-driven fashion by exploring its use in our corpora of reference, TimeBank and the American National Corpus (Slate and NYTimes fragments).22
Distribution of ESPs in De Facto.
Part of Speech . | SIPs . | NSIPs . | Total . |
---|---|---|---|
Verbs | 204 | 189 | 393 |
Nouns | 58 | 107 | 165 |
Adjectives | 27 | 61 | 88 |
Total | 289 | 357 | 646 |
Part of Speech . | SIPs . | NSIPs . | Total . |
---|---|---|---|
Verbs | 204 | 189 | 393 |
Nouns | 58 | 107 | 165 |
Adjectives | 27 | 61 | 88 |
Total | 289 | 357 | 646 |