Decision Lists for Lexical Ambiguity Resolution: Application to Accent Restoration in Spanish and French

نویسنده

  • David Yarowsky
چکیده

This paper presents a statistical decision procedure for lexical ambiguity resolution. The algorithm exploits both local syntactic patterns and more distant collocational evidence, generating an efficient, effective, and highly perspicuous recipe for resolving a given ambiguity. By identifying and utilizing only the single best disambiguating evidence in a target context, the algorithm avoids the problematic complex modeling of statistical dependencies. Although directly applicable to a wide class of ambiguities, the algorithm is described and evaluated in a realistic case study, the problem of restoring missing accents in Spanish and French text. Current accuracy exceeds 99% on the full task, and typically is over 90% for even the most difficult ambiguities.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison of Corpus-based Techniques for Restoring Accents in Spanish and French Text

This paper will explore and compare three corpus-based techniques for lexical ambiguity resolution, focusing on the problem of restoring missing accents to Spanish and French text. Many of the ambiguities created by missing accents are differences in part of speech: hence one of the methods considered is an N-gram tagger using Viterbi decoding, such as is found in stochastic part-of-speech tagg...

متن کامل

DECISION LISTS FOR LEXICAL AMBIGUITYRESOLUTION : Application

This paper presents a statistical decision procedure for lexical ambiguity resolution. The algorithm exploits both local syntactic patterns and more distant collo-cational evidence, generating an eecient, eeective, and highly perspicuous recipe for resolving a given ambiguity. By identifying and utilizing only the single best dis-ambiguating evidence in a target context, the algorithm avoids th...

متن کامل

Decision Lists for Lexical Ambiguityresolution

This paper presents a statistical decision procedure for lexical ambiguity resolution. The algorithm exploits both local syntactic patterns and more distant collo-cational evidence, generating an eecient, eeective, and highly perspicuous recipe for resolving a given ambiguity. By identifying and utilizing only the single best dis-ambiguating evidence in a target context, the algorithm avoids th...

متن کامل

The Phonological Status of the French Initial Accent and its Role in Semantic Processing: An Event-Related Potentials Study

French accentuation is held to belong to the level of the phrase. Consequently French is considered ‘a language without accent’ with speakers that are ‘deaf to stress’. Recent ERP-studies investigating the French initial accent (IA) however demonstrate listeners not only discriminate between different stress patterns, but also prefer words to be marked with IA early in the process of speech com...

متن کامل

Investigating the phonological status of the Initial Accent in French: an Event-Related Potentials study

This Event-Related Potentials (ERP) study investigates the use of prosodic information in the process of lexical access in French. In French, accentuation is said to be post-lexical, with a primary final accent (FA) and secondary initial accent (IA) marking the edges of the phrase. Results from previous studies, however, suggest IA may hold a demarcative function close to the level of the word....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994