Using semantic roles to improve summaries
نویسنده
چکیده
This paper describes preliminary analysis on the influence of the semantic roles in summary generation. The proposed method involves three steps: first, the named entities in the original text are identified using a named entity recognizer; secondly, the sentences are parsed and semantic roles are extracted; thirdly, selection of the sentences containing specific semantic roles for the most relevant entities in text. Although the method is language independent, in order to check its viability, we tested the proposed approach for Romanian summaries.
منابع مشابه
ارائه یک سیستم هوشمند و معناگرا برای ارزیابی سیستم های خلاصه ساز متون
Nowadays summarizers and machine translators have attracted much attention to themselves, and many activities on making such tools have been done around the world. For Farsi like the other languages there have been efforts in this field. So evaluating such tools has a great importance. Human evaluations of machine summarization are extensive but expensive. Human evaluations can take months to f...
متن کاملUsing latent semantic analysis to grade brief summaries: some proposals
In this paper, we present several proposals in order to improve the LSA tools to evaluate brief summaries (less than 50 words) of narrative and expository texts. First, we analyse the quality of six different methods assessing essays that have been widely employed before (Foltz et al., 2000). The second objective is to analyse how new algorithms inspired by some authors (Denhière et al., 2007) ...
متن کاملGenerating Coherent Extracts of Single Documents Using Latent Semantic Analysis
Generating Coherent Extracts of Single Documents Using Latent Semantic Analysis Tristan Miller Master of Science Graduate Department of Computer Science University of Toronto 2003 A major problem with automatically-produced summaries in general, and extracts in particular, is that the output text often lacks textual coherence. Our goal is to improve the textual coherence of automatically produc...
متن کاملCapturing the semantic structure of documents using summaries in Supplemented Latent Semantic Analysis
Latent Semantic Analysis (LSA) is a mathematical technique that is used to capture the semantic structure of documents based on correlations among textual elements within them. Summaries of documents contain words that actually contribute towards the concepts of documents. In the present work, summaries are used in LSA along with supplementary information such as document category and domain in...
متن کاملMulti-Document Summaries Based on Semantic Redundancy
This paper presents a technique for producing short summaries from multiple documents. This technique promotes the belief that informative short summaries may be generated when using conceptual representations of redundant semantic information, called topic semantic signatures. The results of DUC-2002 evaluations account for the advantages of using the techniques presented in this paper.
متن کامل