Extraction Based Multi Document Summarization using Single Document Summary Cluster
نویسنده
چکیده
Multi document summarization has very great impact among research community, ever since the growth of online information and availability. Selecting most important sentences from such huge repository of data is quiet tricky and challenging task. While multi document poses some additional overhead in sentence selection, generating summaries for each individual documents and merging the sentences in a coherent order would greater strength. The proposed approach was competitively better as compared to state of MEAD summarizer at focused compression ratios. This paper focus on three different studies namely i. To find the performance of multi document summarizer from single document cluster (using MEAD) ii. Comparison of our approach with MEAD performance for the dataset considered iii. To extract sentences for multi document summarization at 30% compression rate to obtain 100% efficiency using 7-point summary sheet. Investigation carried out from an average of 22 documents shows that our system is promising.
منابع مشابه
A survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملMulti-document summarization by cluster/profile relevance and redundancy removal
We describe a sentence extraction system that produces two sorts of multi-document summaries: the first is a general-purpose summary of a cluster of related documents while the second is an entity-based summary of documents related to a particular person. The general-purpose summary is generated by a process that ranks sentences based on their document and cluster “worthiness”. The personality-...
متن کاملCentroid-based summarization of multiple documents: sentence extraction utility-based evaluation, and user studies
We present a multi-document summarizer, called MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. We also describe two new techniques, based on sentence utility and subsumption, which we have applied to the evaluation of both single and multiple document summaries. Finally, we describe two user studies that test our models of multi-documen...
متن کاملAMDS: Sentence Extraction Based Proficient Framework For Multi-Document Summarization
Rapid improvement of electronic documents in World Wide Web has made overload to the users in accessing the information. Therefore, abstracting the primary content from numerous documents related to same topic is highly essential. Summarization of multiple documents helps in valuable decision-making in less time. This paper proposed a framework named Adept Multi-Document Summarization (AMDS) fo...
متن کاملMulti-document Summarization System: Using Fuzzy Logic and Genetic Algorithm
In the recent times, the requirement for generation of multi-document summary has gained a lot of attention among the researchers. Mostly, the text summarization technique uses the sentence extraction technique where the salient sentences in the multiple documents are extracted and presented as a summary. In our proposed system, we have developed a sentence extraction based automatic multi-docu...
متن کامل