Butcher, baker, or candlestick maker? Predicting occupations using predicate-argument relations
نویسندگان
چکیده
In a previous question answering study, we identified nine semantic-relationship types, including synonyms, hypernyms, word chains, and holonyms, that exist between terms inText Retrieval Conference queries and those in their supporting sentences in the Advanced Question Answering for Intelligence (Graff, 2002) corpus. The most frequently occurring relationship type was the hypernym (e.g.,KatherineHepburn is an actress).The aim of the present work, therefore, was to develop a method for determining a person’s occupation from syntactic data in a text corpus. First, in the P -System, we compared predicate–argument data involving a proper name for different occupations using Okapi’s BM25 weighting algorithm.When classifying actors and using sufficiently frequent names, an accuracy of 0.955 was attained. For evaluation purposes, we also implemented a standard apposition-based classifier (A-System). This performs well, but only if a particular name happens to appear in apposition with the corresponding occupation. Last, we created a hybrid (H -System) which combines the strengths of P with those of A. Using data with a minimum of 100 predicate–argument pairs,H performed best with an overall lenient accuracy of 0.750 while A and P scored 0.615 and 0.656, respectively. We therefore conclude that a hybrid approach combining information from different sources is the best way to predict occupations.
منابع مشابه
Home care safety markers: a scoping review.
Safety in home care is a new research frontier, and one in which demand for services continues to rise. A scoping review of the home care literature on chronic obstructive pulmonary disease and congestive heart failure was thus completed to identify safety markers that could serve to develop our understanding of safety in this sector. Results generated seven safety markers: (a) Home alone; (b) ...
متن کاملبرچسبزنی نقش معنایی جملات فارسی با رویکرد یادگیری مبتنی بر حافظه
Abstract Extracting semantic roles is one of the major steps in representing text meaning. It refers to finding the semantic relations between a predicate and syntactic constituents in a sentence. In this paper we present a semantic role labeling system for Persian, using memory-based learning model and standard features. Our proposed system implements a two-phase architecture to first identify...
متن کاملUtilizing Automatic Predicate-Argument Analysis for Concept Map Mining
Concept maps can be used to provide concise and structured summaries of documents. Motivated by their usefulness in many application scenarios, several approaches have been suggested for concept map mining, the automatic extraction of concept maps from text. However, a major bottleneck of previous work is the common pattern-based approach used to extract concepts and relations from documents wh...
متن کاملConnotation Frames: Typed Relations of Implied Sentiment in Predicate-Argument Structure
Through a choice of a predicate (e.g., “violate”), a writer can convey subtle sentiments and value judgements toward the arguments of a verb (e.g., projecting the agent as an “antagonist” and the theme as a “victim”). We introduce connotation frames to encode the rich dimensions of implied sentiment, value judgements, and effect evaluation as typed relations that these choices influence, and pr...
متن کاملUnsupervised Discovery of Significant Candlestick Patterns for Forecasting Security Price Movements
Candlestick charts are a visually appealing method of presenting price movements of securities. It has been developed in Japan centuries ago. The depiction of movements as candlesticks tends to exhibit recognizable patterns that allow for predicting future price movements. Common approaches of employing candlestick analysis in automatic systems rely on a manual a-priori specification of well-kn...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JASIST
دوره 62 شماره
صفحات -
تاریخ انتشار 2011