The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French
نویسنده
چکیده
In this paper, we introduce the Lefff , a freely available, accurate and large-coverage morphological and syntactic lexicon for French, used in many NLP tools such as large-coverage parsers. We first describe Alexina, the lexical framework in which the Lefff is developed as well as the linguistic notions and formalisms it is based on. Next, we describe the various sources of lexical data we used for building the Lefff , in particular semi-automatic lexical development techniques and conversion and merging of existing resources. Finally, we illustrate the coverage and precision of the resource by comparing it with other resources and by assessing its impact in various NLP tools.
منابع مشابه
Building a morphological and syntactic lexicon by merging various linguistic resources
This paper shows how large-coverage morphological and syntactic NLP lexicons can be developed by interpreting, converting to a common format and merging existing lexical resources. Applied on Spanish, this allowed us to build a morphological and syntactic lexicon, the Leffe. It relies on the Alexina framework, originally developed together with the French lexicon Lefff. We describe how the inpu...
متن کاملUsing Lexicon-Grammar Tables for French Verbs in a Large-Coverage Parser
In this paper, we describe the integration of Lexicon-Grammar tables for French verbs in the large-coverage FRMG parser and the evaluation of the resulting parser. This integration required a conversion step so as to extract the syntactic information encoded in Lexicon-Grammar tables and represent it in the NLP lexical formalism used by FRMG, i.e., the Alexina framework (that of the Lefff lexic...
متن کاملMining Parsing Results for Lexical Correction: Toward a Complete Correction Process of Wide-Coverage Lexicons
The coverage of a parser depends mostly on the quality of the underlying grammar and lexicon. The development of a lexicon both complete and accurate is an intricate and demanding task. We introduce a automatic process for detecting missing, incomplete and erroneous entries in a morphological and syntactic lexicon, and for suggesting corrections hypotheses for these entries. The detection of du...
متن کاملExtending the adverbial coverage of a French morphological lexicon
We present an extension of the adverbial entries of the French morphological lexicon DELA (Dictionnaires Electroniques du LADL / LADL electronic dictionaries). Adverbs were extracted from LGLex, a NLP-oriented syntactic resource for French, which in its turn contains all adverbs extracted from the Lexicon-Grammar tables of both simple adverbs ending in -ment (i.e., ’-ly’) (Molinier and Levrier,...
متن کاملA Morphological Lexicon for the Persian Language
We introduce PerLex, a large-coverage and freely-available morphological lexicon for the Persian language. We describe the main features of the Persian morphology, and the way we have represented it within the Alexina formalism, on which PerLex is based. We focus on the methodology we used for constructing lexical entries from various sources, as well as the problems related to typographic norm...
متن کامل