Decision Lists for Lexical Ambiguity Resolution: Application to Accent Restoration in Spanish and French
نویسنده
چکیده
This paper presents a statistical decision procedure for lexical ambiguity resolution. The algorithm exploits both local syntactic patterns and more distant collocational evidence, generating an efficient, effective, and highly perspicuous recipe for resolving a given ambiguity. By identifying and utilizing only the single best disambiguating evidence in a target context, the algorithm avoids the problematic complex modeling of statistical dependencies. Although directly applicable to a wide class of ambiguities, the algorithm is described and evaluated in a realistic case study, the problem of restoring missing accents in Spanish and French text. Current accuracy exceeds 99% on the full task, and typically is over 90% for even the most difficult ambiguities.
منابع مشابه
A Comparison of Corpus-based Techniques for Restoring Accents in Spanish and French Text
This paper will explore and compare three corpus-based techniques for lexical ambiguity resolution, focusing on the problem of restoring missing accents to Spanish and French text. Many of the ambiguities created by missing accents are differences in part of speech: hence one of the methods considered is an N-gram tagger using Viterbi decoding, such as is found in stochastic part-of-speech tagg...
متن کاملDECISION LISTS FOR LEXICAL AMBIGUITYRESOLUTION : Application
This paper presents a statistical decision procedure for lexical ambiguity resolution. The algorithm exploits both local syntactic patterns and more distant collo-cational evidence, generating an eecient, eeective, and highly perspicuous recipe for resolving a given ambiguity. By identifying and utilizing only the single best dis-ambiguating evidence in a target context, the algorithm avoids th...
متن کاملDecision Lists for Lexical Ambiguityresolution
This paper presents a statistical decision procedure for lexical ambiguity resolution. The algorithm exploits both local syntactic patterns and more distant collo-cational evidence, generating an eecient, eeective, and highly perspicuous recipe for resolving a given ambiguity. By identifying and utilizing only the single best dis-ambiguating evidence in a target context, the algorithm avoids th...
متن کاملThe Phonological Status of the French Initial Accent and its Role in Semantic Processing: An Event-Related Potentials Study
French accentuation is held to belong to the level of the phrase. Consequently French is considered ‘a language without accent’ with speakers that are ‘deaf to stress’. Recent ERP-studies investigating the French initial accent (IA) however demonstrate listeners not only discriminate between different stress patterns, but also prefer words to be marked with IA early in the process of speech com...
متن کاملInvestigating the phonological status of the Initial Accent in French: an Event-Related Potentials study
This Event-Related Potentials (ERP) study investigates the use of prosodic information in the process of lexical access in French. In French, accentuation is said to be post-lexical, with a primary final accent (FA) and secondary initial accent (IA) marking the edges of the phrase. Results from previous studies, however, suggest IA may hold a demarcative function close to the level of the word....
متن کامل