Exploiting Rules for Word Sense Disambiguation in Machine Translation

نویسندگان

  • Lucia Specia
  • Maria das Graças Volpe Nunes
  • Mark Stevenson
چکیده

This paper describes the automatic generation and the evaluation of sets of rules for word sense disambiguation (WSD) in machine translation. The ultimate aim is to identify high-quality rules that can be used as knowledge sources in a relational WSD model. The evaluation was carried out both automatically, by means of four objective measures (error, coverage, support and novelty), and manually, by means of a subjective analysis of the level of interest of the best rules as pointed out by the objective measures. As a result, we selected 63 rules addressing seven highly ambiguous verbs. The evaluation also evidenced which kinds of knowledge were effectively used by the WSD rules, which are not always the same as those revealed by traditional evaluations of complete WSD models. Although we experimented with English-Portuguese, the rule generation and evaluation procedures could be applied to any language pair, provided that there is a disambiguation sample corpus for that language pair.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting the Translation Context for Multilingual WSD

We propose a strategy to support Word Sense Disambiguation (WSD) which is designed specifically for multilingual applications, such as Machine Translation. Co-occurrence information extracted from the translation context, i.e., the set of words which have already been translated, is used to define the order in which disambiguation rules produced by a machine learning algorithm are applied. Expe...

متن کامل

A Word Sense Disambiguation Method Using Bilingual Corpus

This paper proposes a word sense disambiguation (WSD) method using bilingual corpus in English-Chinese machine translation system. A mathematical model is constructed to disambiguate word in terms of context phrasal collocation. A rules learning algorithm is proposed, and an application algorithm of the learned rules is also provided, which can increase the recall ratio. Finally, an analysis is...

متن کامل

Mining Rules for Word Sense Disambiguation

This paper describes the automatic generation and the evaluation of sets of rules for word sense disambiguation (WSD) in machine translation. The ultimate aim is to identify high-quality rules that can be used as knowledge sources in a relational WSD model. The evaluation was carried out both automatically, by means of four objective measures (error, coverage, support and novelty), and manually...

متن کامل

Towards A Hybrid Approach To Word-Sense Disambiguation In Machine Translation

The task of word sense disambiguation aims to select the correct sense of a polysemous word in a given context. When applied to machine translation, the correct translation in the target language must be selected for a polysemous lexical item in the source language. In this paper, we present work in progress on a supervised WSD system with a hybrid approach: on the one hand it relies on supervi...

متن کامل

Bootstrapping Phrase-based Statistical Machine Translation via WSD Integration

Beside the word order problem, word choice is another major obstacle for machine translation. Though phrase-based statistical machine translation (SMT) has an advantage of word choice based on local context, exploiting larger context is an interesting research topic. Recently, there have been a number of studies on integrating word sense disambiguation (WSD) into phrase-based SMT. The WSD score...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Procesamiento del Lenguaje Natural

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2005