Sentence-based Summarization of Scientific Documents The design and implementation of an online available automatic summarizer

نویسندگان

  • W. T. Visser
  • M. B. Wieling
چکیده

∗ In Edmundson (1969) four features are used: the title feature, the cue word feature, the location feature and the word frequency feature. The frequency method Edmundson applied used the frequency of relevant words (frequency larger than a certain threshold and not being a common word) and assigned a score to each sentence based on the frequency of the relevant words in the sentence. Because of these reasons the method of Luhn (1958) is chosen as the preferred method for calculating the frequency score of a sentence. A sentence-based automatic summarization system has been developed, which benefits from a mixture of ideas founded in the early days of automatic summarization.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Automatic Summarization in Organic Chemistry

We present an oriented numerical summarizer algorithm, applied to producing automatic summaries of scientific documents in Organic Chemistry. We present its implementation named Yachs (Yet Another Chemistry Summarizer) that combines a specific document preprocessing with a sentence scoring method relying on the statistical properties of documents. We show that Yachs achieves the best results am...

متن کامل

Biogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization

    Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...

متن کامل

Quantifying the informativeness for biomedical literature summarization: An itemset mining method

OBJECTIVE Automatic text summarization tools can help users in the biomedical domain to access information efficiently from a large volume of scientific literature and other sources of text documents. In this paper, we propose a summarization method that combines itemset mining and domain knowledge to construct a concept-based model and to extract the main subtopics from an input document. Our ...

متن کامل

An Efficient Statistical Approach for Automatic Organic Chemistry Summarization

In this paper, we propose an efficient strategy for summarizing scientific documents in Organic Chemistry that concentrates on numerical treatments. We present its implementation named yachs (Yet Another Chemistry Summarizer) that combines a specific document preprocessing with a sentence scoring method relying on the statistical properties of documents. We show that yachs achieves the best res...

متن کامل

Centroid-based summarization of multiple documents

We present a multi-document summarizer, MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. We describe two new techniques, a centroid-based summarizer, and an evaluation scheme based on sentence utility and subsumption. We have applied this evaluation to both single and multiple document summaries. Finally, we describe two user studies tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005