Semi-automated Extraction of a Wide-Coverage Type-Logical Grammar for French
نویسندگان
چکیده
The paper describes the development of a wide-coverage type-logical grammar for French, which has been extracted from the Paris 7 treebank and received a significant amount of manual verification and cleanup. The resulting treebank is evaluated using a supertagger and performs at a level comparable to the best supertagging results for English. Résumé. Cet article décrit le développement d’une grammaire catégorielle à large couverture du Français, extraite à partir du corpus arboré de Paris 7 et vérifiée et corrigée manuellement. Le grammaire catégorielle résultant est évaluée en utilisant un supertagger et obtient des résultats comparables aux meilleurs supertaggers pour l’Anglais. Mots-clés : Extraction de grammaires, grammaires catégorielles, supertagging.
منابع مشابه
Wide-Coverage Grammar Extraction from Thai Treebank
Parsing is an important step for natural language understanding, including phrase alignment for supporting statistical machine translation. Ability on analysing real text by parser strongly depends on grammar. Treebank could be one of the sources for grammar extraction. However, treebank construction largely relies on human annotators intuitions. Different intuitions from multiple annotators br...
متن کاملBuilding Deep Dependency Structures with a Wide-Coverage CCG Parser
This paper describes a wide-coverage statistical parser that uses Combinatory Categorial Grammar (CCG) to derive dependency structures. The parser differs from most existing wide-coverage treebank parsers in capturing the long-range dependencies inherent in constructions such as coordination, extraction, raising and control, as well as the standard local predicate-argument dependencies. A set o...
متن کاملAutomating the Generation of a Wide-coverage LFG for French using a MetaGrammar
In this paper, we explain how the notion of MetaGrammar, which has successfully been used for generating wide-coverage tree adjoining grammars (TAGs) for various languages such as French (Abeillé et al. (1999)) and German (Gerdes (2002)), may be used to generate a wide-coverage Lexical Functional Grammar (LFG) for French. We first introduce the notion of MetaGrammar and present the tools we use...
متن کاملBuilding Deep Dependency Structures using a Wide-Coverage CCG Parser
This paper describes a wide-coverage statistical parser that uses Combinatory Categorial Grammar (CCG) to derive dependency structures. The parser differs from most existing wide-coverage treebank parsers in capturing the long-range dependencies inherent in constructions such as coordination, extraction, raising and control, as well as the standard local predicate-argument dependencies. A set o...
متن کاملFTAG : developping and maintaining a wide - coverage grammar for French
We describe the current status and organization of a French Lexicalized Tree Adjoining Grammar (FTAG), developed over the last 10 years at Paris 7 (Abeillé 91, Abeillé & al. 99). The new version of the grammar is generated semi-automatically, independently of any corpus or application domain. It is intended to model speakers' competence, and can be used both for parsing and generation. As far a...
متن کامل