Resolving Ambiguous Preposition Phrase Using Genetic Algorithm

نویسندگان

  • Jimmy
  • Simarjeet Kaur
چکیده

Text mining refers to the process of discovering interesting and non trivial patterns or knowledge embedded in unstructured text documents from a fixed domain. It is also known as knowledge discovery from text databases. Text mining tasks include text categorization, text clustering, concept/entity extraction, document summarization and entity relation modelling. Extracting concept/fact from the texts is the first step in text mining. But while extracting concept/fact from texts in natural language, the main problem is that some words or phrases or sentences are always ambiguous. The ambiguities are of different types. It may be lexical, semantic or syntactic. There are several existing techniques for resolving this problem which has been implemented through a set of test cases. Also better results are obtained by applying these techniques. These techniques are discussed in this paper.

منابع مشابه

Statistical Models for Unsupervised Prepositional Phrase Attachement

We present several unsupervised statistical models for the prepositional phrase attachment task that approach the accuracy of the best supervised methods for this task. Our unsupervised approach uses a heuristic based on attachment proximity and trains from raw text that is annotated with only part-of-speech tags and morphological base forms, as opposed to attachment information. It is therefor...

متن کامل

A “Random Walk” through Prepositional Phrase Attachment

This document surveys the field of research in preposition phrase attachment resolution, looking back at the way the problem was approached in the 1990s, as well as at modern techniques for resolving this basic syntactic ambiguity problem. Techniques are described and compared, and their effectiveness is considered.

متن کامل

The Use of Relative Duration in Syntactic Disambiguation

We describe the modification of a grammar to take advantage of prosodic information automatically extracted from speech. The work includes (1) the development of an integer "break index" representation of prosodic phrase boundary information, (2) the automatic detection of prosodic phrase breaks using a hidden Markov model on relative duration of phonetic segments, and (3) the integration of th...

متن کامل

Attaching Multiple Prepositional Phrases: Generalized Backed-oo Estimation

There has recently been considerable interest in the use of lexically-based statistical techniques to resolve preposition-al phrase attachments. To our knowledge , however, these investigations have only considered the problem of attaching the rst PP, i.e., in a V NP PP] conngura-tion. In this paper, we consider one technique which has been successfully applied to this problem, backed-oo estima...

متن کامل

Probabilistic Parse Scoring Based on Prosodic Phrasing

The relative size and location of prosodic phrase boundaries provides an important cue for resolving syntactic ambiguity. In previous work, we have introduced an analysis/synthesis formalism for scoring parses in terms of the similarity between prosodic patterns recognized from a given utterance and synthesized for the hypothesized parse. This paper describes a new approach to the synthesis pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014