Improving the Representation of Word-Formation in Multilingual Lexicographic Tools: the MuLeXFoR Database

نویسندگان

  • Bruno Cartoni
  • Marie-Aude Lefer
چکیده

This paper introduces a new lexicographic resource, MuLeXFoR, which aims to present word-formation processes in a multilingual database designed for both language specialists (e.g. linguists, terminologists, lexicographers, NLP specialists) as well as second-language (L2) learners and trainee translators. Morphological items (e.g. affixes, compound parts, combining forms) and processes (prefixation, suffixation, compounding, conversion, etc.) pose major challenges for lexicographic work, especially with respect to the design of bilingual and multilingual resources. It is well-known that derivational affixes can take part in several word-formation rules and that, conversely, rules can be realised by means of a variety of affixes. In view of this complexity, it is often difficult to (1) provide enough information to help users understand the meaning(s) of an affix and the (near-)synonymy relations between affixes and (2) become familiar with the most frequent strategies used to translate the meaning(s) conveyed by these affixes. In fact, traditional dictionaries often fail to achieve this goal. The MuLeXFoR database tries to take advantage of recent advances in morphological description and the development of electronic multiaccess database systems. The database relies on the lexematic approach to word-formation, which is especially helpful to represent morphological processes cross-linguistically. In addition, it has been entirely implemented in a multi-access database interface. The prototype described in this paper so far centres around prefixation in English, French and Italian. Two interfaces are currently available: a comprehensive interface aimed at morphological and lexicographic investigations by language specialists (MuLeXFoR-Linguists) and a second interface designed for second-language learners or trainee translators (MuLeXFoR-Learners). Section 1 first briefly discusses the ways in which word-formation processes are currently presented in dictionaries and brings to light some of the shortcomings involved in these procedures. Section 2 then describes the theoretical approach that has been adopted to formalise word-formation processes from a multilingual perspective, viz. the lexematic approach. Section 3 introduces the MuLeXFoR database and provides a detailed description of the ways users can browse the tool. Section 4 presents the ways in which the database was adapted to second-language learners, mainly by simplifying labels and menu names and by adding information specific to the production of new words in L2. Finally, Sections 5 and 6 deal with the various implementation and data collection issues raised by our approach. The paper ends with some concluding remarks in Section 7.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The MuLeXFoR Database: Representing Word-Formation Processes in a Multilingual Lexicographic Environment

This paper introduces a new lexicographic resource, the MuLeXFoR database, which aims to present word-formation processes in a multilingual environment. Morphological items represent a real challenge for lexicography, especially for the development of multilingual tools. The database introduced in this paper tries to take advantage of recent advances in electronic implementation and morphologic...

متن کامل

From Resources to Applications. Designing the Multilingual ISLE Lexical Entry

The ISLE Computational Lexicon Working Group is committed to the consensual definition of a standardized infrastructure to develop multilingual resources for HLT applications. In particular, the ISLE-CLWG pursues this goal by designing MILE (Multilingual ISLE Lexical Entry), a general schema for the encoding of multilingual lexical information. This has to be intended as a meta-entry, acting as...

متن کامل

Åùðøøððòòùùð Áòòóöññøøóò Èöó×××òò Óò Êêððøøóòòð Øøøø×× Ööööøøøøùöö×

EÆcient storage and query processing of data spanning multiple natural languages are of crucial importance in today's globalized world. A primary prerequisite to achieve this goal is that the principal data repositories, relational database systems, should eÆciently and seamlessly support multilingual data. Our survey of current relational systems indicates that while they do support storage an...

متن کامل

Optimization of 241Am-Be emission direction in neutron porosity tools for improving the precision in determining the porosity in calcite formation

Investigation of Hydrocarbon reservoir is important, so it is essential to predict and explore them precisely. One of the methods is well logging, which can transfer the probe or tool in the well to measure one or more characteristics. Nuclear well logging includes radioisotope source and at least one detector. In this work, emission direction of neutrons from the 241Am-Be neutron source toward...

متن کامل

Word Type Effects on L2 Word Retrieval and Learning: Homonym versus Synonym Vocabulary Instruction

The purpose of this study was twofold: (a) to assess the retention of two word types (synonyms and homonyms) in the short term memory, and (b) to investigate the effect of these word types on word learning by asking learners to learn their Persian meanings. A total of 73 Iranian language learners studying English translation participated in the study. For the first purpose, 36 freshmen from an ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010