Automatic Generation of Titles for a Corpus of Questions
نویسندگان
چکیده
This paper describes the followed methodology to automatically generate titles for a corpus of questions that belong to sociological opinion polls. Titles for questions have a twofold function: (1) they are the input of user searches and (2) they inform about the whole contents of the question and possible answer options. Thus, generation of titles can be considered as a case of automatic summarization. However, the fact that summarization had to be performed over very short texts together with the aforementioned quality conditions imposed on new generated titles led the authors to follow knowledge-rich and domain-dependent strategies for summarization, disregarding the more frequent extractive techniques for summarization.
منابع مشابه
Automatic Title Generation using EM
Our prototype automatic title generation system inspired by statistical machine-translation approaches [1] treats the document title like a translation of the document. Titles can be generated without extracting words from the document. A large corpus of documents with human-assigned titles is required for training title “translation” models. On an f1 evaluation score our approach outperformed ...
متن کاملAutomatic title generation for Chinese spoken documents using an adaptive k nearest-neighbor approach
The purpose of automatic title generation is to understand a document and to summarize it with only several but readable words or phrases. It is important for browsing and retrieving spoken documents, which may be automatically transcribed, but it will be much more helpful if given the titles indicating the content subjects of the documents. For title generation for Chinese language, additional...
متن کاملKnowledge Extraction for Question Titling
This article describes the work carried out over the database of questions belonging to the different opinion polls carried out over the last 50 years in Spain. Approximately half of the questions are provided with a title while the other half is untitled. It is described the work and techniques implemented in order to automatically generate the titles for the corpus of untitled questions. The ...
متن کاملAutomatic Title Generation for Spoken Broadcast News
In this paper, we implemented a set of title generation methods using training set of 21190 news stories and evaluated them on an independent test corpus of 1006 broadcast news documents, comparing the results over manual transcription to the results over automatically recognized speech. We use both F1 and the average number of correct title words in the correct order as metric. Overall, the re...
متن کاملAutomatic Question Generation from Punjabi Text with Mcq Based on Hybrid Approach
Automatic question generation is an important area of Natural Language Processing that deals with the automatic generation of questions from the given sentence or paragraph in any Indian languages like Hindi, Punjabi, Marathi, Telugu, Gujarati, Urdu, Bengali, Malayalam, Kannada etc.,. This paper is presenting the research on automatic generation of questions from the given paragraph in Punjabi ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008