Machine Learning Approach to Multi-Document Summarization.
نویسندگان
چکیده
منابع مشابه
An Approach for Concept-based Automatic Multi- Document Summarization using Machine Learning
Text Summarization is compressing the source text into a shorter version preserving its information content and overall meaning. It is very complicated for human beings to manually summarize large documents of text. Text summarization plays an important role in the area of natural language processing and text mining. Many approaches use statistics and machine learning techniques to extract sent...
متن کاملApproach of Topical Multi-document Summarization
This paper describes an innovative algorithm that aims to solve the topical multidocument summarization problems. Given a user input topical query and a large unlabeled document collections, the algorithm first clustering the documents into a set of clusters by using the efficient spherical k-means algorithm and ranks each document and the cluster on the basis of the approximation to the topic....
متن کاملA Graph-based Approach to Cross-language Multi-document Summarization
Cross-language summarization is the task of generating a summary in a language different from the language of the source documents. In this paper, we propose a graph-based approach to multi-document summarization that integrates machine translation quality scores in the sentence extraction process. We evaluate our method on a manually translated subset of the DUC 2004 evaluation campaign. Resul...
متن کاملA SVM-Based Ensemble Approach to Multi-Document Summarization
In this paper, we present a Support Vector Machine (SVM) based ensemble approach to combat the extractive multi-document summarization problem. Although SVM can have a good generalization ability, it may experience a performance degradation through wrong classifications. We use a committee of several SVMs, i.e. Cross-Validation Committees (CVC), to form an ensemble of classifiers where the stra...
متن کاملA Sentence-Trimming Approach to Multi-Document Summarization
We implemented an initial application of a sentence-trimming approach (Trimmer) to the problem of multi-document summarization in the MSE2005 and DUC2005 tasks. Sentence trimming was incorporated into a feature-based summarization system, called MultiDocument Trimmer (MDT), by using sentence trimming as both a preprocessing stage and a feature for sentence ranking. We demonstrate that we were a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Natural Language Processing
سال: 2003
ISSN: 1340-7619,2185-8314
DOI: 10.5715/jnlp.10.81