An Approach to Lexical Development for Inflectional Languages
نویسندگان
چکیده
We describe a method for the semi-automatic development of morphological lexicons. The method aims at using minimal pre-existing resources and only relies upon the existence of a raw text corpus and a database of inflectional classes. No lexicon or list of base forms is assumed. The method is based on a contrastive approach, which generates hypothetical entries based on evidence drawn form a corpus, and selects the best candidates by heuristically comparing the candidate entries. The reliance upon inflectional information and the use of minimal resources make this approach particularly suitable for highly inflectional, lower-density languages. A prototype tool has been developed for Modern Greek.
منابع مشابه
Early Phonological and Lexical Development of a Farsi Speaking Child: A Longitudinal Case Study
The present study aims at the description and analysis of the phonological and lexical development of a child who is acquiring Farsi as his first language. The child's language production at the holophrastic stage of language development, mainly single words, is observed and recorded longitudinally for nearly seven months since he was 16 months old until he turned 23 months. An attempt is mad...
متن کاملExtracting Semantic Classes and Morphosyntactic Features for English-Polish Machine Translation
This paper describes a procedure aimed at automatic extraction of certain noun and verb categories from Polish texts. The general goal is to construct a lexical database that should be incorporated into a system for machine translation and multilingual generation of summaries. High quality processing of inflectional languages like Polish requires quite elaborated lexical entries, it is therefor...
متن کاملLexical Analysis of Agglutinative Languages Using a Dictionary of Lemmas and Lexical Transducers
This paper presents a simple method for performing a lexical analysis of agglutinative languages like Korean, which have a heavy morphology. Especially, for nouns and adverbs with regular morphological modifications and/or high productivity, we do not need to artificially construct huge dictionaries of all inflected forms of lemmas. To construct a dictionary of lemmas and lexical transducers, f...
متن کاملTerminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automat...
متن کاملSuppletion and dependency in inflectional morphology
The purpose of this paper is to present a general approach to verbal inflection with special emphasis on suppletion phenomena. The approach is applied to French in this paper, but it extends straightforwardly to other languages.1 The first part of the paper describes an analysis of suppletion in inflectional morphology with two design requirements. First, we attempt to provide an analysis which...
متن کامل