Sentence-based Summarization of Scientific Documents The design and implementation of an online available automatic summarizer
نویسندگان
چکیده
∗ In Edmundson (1969) four features are used: the title feature, the cue word feature, the location feature and the word frequency feature. The frequency method Edmundson applied used the frequency of relevant words (frequency larger than a certain threshold and not being a common word) and assigned a score to each sentence based on the frequency of the relevant words in the sentence. Because of these reasons the method of Luhn (1958) is chosen as the preferred method for calculating the frequency score of a sentence. A sentence-based automatic summarization system has been developed, which benefits from a mixture of ideas founded in the early days of automatic summarization.
منابع مشابه
Statistical Automatic Summarization in Organic Chemistry
We present an oriented numerical summarizer algorithm, applied to producing automatic summaries of scientific documents in Organic Chemistry. We present its implementation named Yachs (Yet Another Chemistry Summarizer) that combines a specific document preprocessing with a sentence scoring method relying on the statistical properties of documents. We show that Yachs achieves the best results am...
متن کاملBiogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization
Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...
متن کاملQuantifying the informativeness for biomedical literature summarization: An itemset mining method
OBJECTIVE Automatic text summarization tools can help users in the biomedical domain to access information efficiently from a large volume of scientific literature and other sources of text documents. In this paper, we propose a summarization method that combines itemset mining and domain knowledge to construct a concept-based model and to extract the main subtopics from an input document. Our ...
متن کاملAn Efficient Statistical Approach for Automatic Organic Chemistry Summarization
In this paper, we propose an efficient strategy for summarizing scientific documents in Organic Chemistry that concentrates on numerical treatments. We present its implementation named yachs (Yet Another Chemistry Summarizer) that combines a specific document preprocessing with a sentence scoring method relying on the statistical properties of documents. We show that yachs achieves the best res...
متن کاملCentroid-based summarization of multiple documents
We present a multi-document summarizer, MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. We describe two new techniques, a centroid-based summarizer, and an evaluation scheme based on sentence utility and subsumption. We have applied this evaluation to both single and multiple document summaries. Finally, we describe two user studies tha...
متن کامل