Linguistic resources required by the factored lexicon. Equivalent resources for Arabic and French do not presently exist. The ATB lacks gold lemmas and a French morphological ranker equivalent to MADA—which can produce the full set of morphosyntactic features specified in the ATB—has not been developed. Morfette is effectively a discriminative classifier that treats analyses as atomic labels, whereas MADA utilizes a morphological generator.
. | Arabic (ATB) . | French (FTB) . |
---|---|---|
Gold Morphological Features | Gender, Number, Tense, Person, Mood, Voice, Definiteness | Gender, Number, Tense, Person |
Gold Lemmas | × | ✓ |
Morphological Analyzer | ✓ (SAMA) | × |
Morphological Ranker | ✓ (MADA) | ✓ (Morfette) |
Lemmatizer | ✓ (MADA) | ✓ (Morfette) |
. | Arabic (ATB) . | French (FTB) . |
---|---|---|
Gold Morphological Features | Gender, Number, Tense, Person, Mood, Voice, Definiteness | Gender, Number, Tense, Person |
Gold Lemmas | × | ✓ |
Morphological Analyzer | ✓ (SAMA) | × |
Morphological Ranker | ✓ (MADA) | ✓ (Morfette) |
Lemmatizer | ✓ (MADA) | ✓ (Morfette) |