Integrating Symbolic and Statistical Methods for Prepositional Phrase Attachment
نویسندگان
چکیده
This paper’ presents a novel methodology of resolving prepositional phrase attachment ambiguities. The approach consists of three phases. First, we rely on a publicly available database to classify a large corpus of prepositional attachments extracted from the Treebank parses. As a by-product, the arguments of every prepositional relation are semantically disambiguated. In the second phase, the thematic interpretation of the prepositional relations provides additional knowledge. The third phase is concerned with learning attachment decisions from word class knowledge and relation type features. The learning technique builds upon some of the most popular current statistical techniques. We have tested this methodology on (1) Wall Street Journal articles, (2) textual definitions of concepts from a dictionary and (3) an ad-hoc corpus of Web documents, used for conceptual indexing and information extraction.
منابع مشابه
A Maximum Entropy Model for Prepositional Phrase Attachment
For this example, a human annotator's attachment decision, which for our purposes is the "correct" attachment, is to the noun phrase. We present in this paper methods for constructing statistical models for computing the probability of attachment decisions. These models could be then integrated into scoring the probability of an overall parse. We present our methods in the context of prepositio...
متن کاملStatistical Models for Unsupervised Prepositional Phrase Attachment
We present several unsupervised statistical models for the prepositional phrase attachment task that approach the accuracy of the best supervised methods for this task. Our unsupervised approach uses a heuristic based on attachment proximity and trains h'om raw text that is annotated with only part-oi;speech tags and morphologicM base forms, as opposed to attachment information. It is therefore...
متن کاملHybrid Disambiguation of Prepositional Phrase Attachment and Interpretation
In this paper, a hybrid disambiguation method for the prepositional phrase (PP) attachment and interpretation problem is presented. 1 The data needed, semantic PP interpretation rules and an annotated corpus, is described first. Then the three major steps of the disambiguation method are: explained. Cross-validated evaluation results', for German (88.6-94.4% correct for binary attachment ambigu...
متن کاملImproving Prepositional Phrase Attachment Disambiguation Using the Web as Corpus
The problem of Prepositional Phrase (PP) attachment disambiguation consists in determining if a PP is part of a noun phrase, as in He sees the room with books, or an argument of a verb, as in He fills the room with books. Volk has proposed two variants of a method that queries an Internet search engine to find the most probable attachment variant. In this paper we apply the latest variant of Vo...
متن کاملIntegration of Semantic and Syntactic Constraints for Structural Noun Phrase Disambiguation
A fundamental problem in Natural Language Processing is the integration of syntactic and semantic constraints. In this paper we describe a new approach for the integration of syntactic and semantic constraints which takes advantage of a learned memory model. Our model combines localist representations for the integration of constraints and distributed representations for learning semantic const...
متن کامل