Monologue Summarization: Extraction of Important Sentences for TV News Commentary Programs
نویسندگان
چکیده
The extraction of important sentences is a key technique for automatic summarization. Whereas most research in this area has targeted written language, we are conducting research on spoken language monologues such as presentations and TV news commentary programs. We collected 50 TV news commentary programs, and experimented with the extraction of important sentences from transcriptions. We used two extraction methods. The first one uses word statistics, and the second one uses the surface features of the sentences. In order to use the latter method, we analyzed the transcriptions and obtained surface features related to the importance of the sentences. The experiments showed that the latter method was better than the former one especially when extracting small sets of sentences. We also mention the ambiguity of judgment by individuals and the contribution of each surface feature to the importance of the sentences.
منابع مشابه
Speech to speech translation system f approach
This paper describes ongoing research on a Japanese-to-English speech-to-speech translation system for “controlled monologue", such as TV news and commentary programs in which the speaking styles are controlled as a monologue. We have adopted the data-driven approach since the TV programs in question cover a wide range of topics, and because it seems much too labor intensive to handcraft transl...
متن کاملمقایسه روشهای مختلف یادگیری ماشین در خلاصهسازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت
In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...
متن کاملEXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کاملKey Phrase Extraction of Lightly Filtered Broadcast News
This paper explores the impact of light filtering on automatic key phrase extraction (AKE) applied to Broadcast News (BN). Key phrases are words and expressions that best characterize the content of a document. Key phrases are often used to index the document or as features in further processing. This makes improvements in AKE accuracy particularly important. We hypothesized that filtering out ...
متن کاملTowards Constructing Sports News from Live Text Commentary
In this paper, we investigate the possibility to automatically generate sports news from live text commentary scripts. As a preliminary study, we treat this task as a special kind of document summarization based on sentence extraction. We formulate the task in a supervised learning to rank framework, utilizing both traditional sentence features for generic document summarization and novelly des...
متن کامل