Improving Disambiguation of Prepositional Phrase Attachments Using the Web as Corpus*

نویسندگان

  • Hiram Calvo
  • Alexander Gelbukh
چکیده

The problem of disambiguating Prepositional Phrase (PP) Attachments consists in determining if a PP is part of a Noun Phrase, as in He sees the room with books, or an argument of a verb, as in He fills the room with books. Volk has proposed two variants of a method that queries an Internet search engine to find the most probable Prepositional Phrase attachment. In this paper we apply the latest variant of Volk’s method to Spanish with several differences that allow us to attain a better performance near to that of statistical methods using treebanks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Prepositional Phrase Attachment Disambiguation Using the Web as Corpus

The problem of Prepositional Phrase (PP) attachment disambiguation consists in determining if a PP is part of a noun phrase, as in He sees the room with books, or an argument of a verb, as in He fills the room with books. Volk has proposed two variants of a method that queries an Internet search engine to find the most probable attachment variant. In this paper we apply the latest variant of Vo...

متن کامل

Prepositional Phrase Attachment Disambiguation using WordNet

In this thesis we use a knowledge-based approach to disambiguating prepositional phrase attachments in English sentences. This method was first introduced by S. M. Harabagiu. The Penn Treebank corpus is used as the training text. We extract 4-tuples of the form [ V P , NP1, Prep, NP2 ] and sort them into classes according to the semantic relationships between parts of each tuple. These relation...

متن کامل

Corpus Creation for New Genres: A Crowdsourced Approach to PP Attachment

This paper explores the task of building an accurate prepositional phrase attachment corpus for new genres while avoiding a large investment in terms of time and money by crowdsourcing judgments. We develop and present a system to extract prepositional phrases and their potential attachments from ungrammatical and informal sentences and pose the subsequent disambiguation tasks as multiple choic...

متن کامل

A Rule-Based Approach to Prepositional Phrase Attachment Disambiguation

I:n this paper, we describe a new corpus-based approach to prepositional phrase a t t achment disambiguation, and present results colnparing peffo> mange of this a lgori thm with other corpus-based approaches to this problem.

متن کامل

Acquiring Selectional Preferences from Untagged Text for Prepositional Phrase Attachment Disambiguation

Extracting information automatically from texts for database representation requires previously well-grouped phrases so that entities can be separated adequately. This problem is known as prepositional phrase (PP) attachment disambiguation. Current PP attachment disambiguation systems require an annotated treebank or they use an Internet connection to achieve a precision of more than 90%. Unfor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003