Topic Models for Summarizing Novelty

نویسندگان

  • James Allan
  • Rahul Gupta
  • Vikas Khandelwal
چکیده

We define temporal summaries of news stories as extracting as few sentences as possible from each event within a news topic, where the stories are presented one at a time and sentences from a story must be ranked before the next story can be considered. We outline an evaluation strategy that we have developed for this task and describe simple language models for capturing novelty and usefulness in the context of summarization. We show that the simple approaches work moderately well, and outline our ideas for moving forward.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

اولویت‌بندی معیارهای انتخاب موضوع پایان‌نامه با روش تحلیل سلسله مراتبی (AHP) از دیدگاه دانشجویان دکتری

Background and Aim: Choosing thesis topic is one of the most important decisions of postgraduate students and many factors affect such decision. This study aimed to prioritize the criteria for choosing thesis topic from Ph.D. students’ viewpoint, using the analytic hierarchy process (AHP) and ranking methods. Materials and Methods: This analytical study was carried out on the School of Public ...

متن کامل

DualSum: a Topic-Model based approach for update summarization

Update summarization is a new challenge in multi-document summarization focusing on summarizing a set of recent documents relatively to another set of earlier documents. We present an unsupervised probabilistic approach to model novelty in a document collection and apply it to the generation of update summaries. The new model, called DUALSUM, results in the second or third position in terms of ...

متن کامل

NLP Driven Models for Automatically Generating Survey Articles for Scientific Topics

This thesis presents new methods that use natural language processing (NLP) driven models for summarizing research in scientific fields. Given a topic query in the form of a text string, we present methods for finding research articles relevant to the topic as well as summarization algorithms that use lexical and discourse information present in the text of these articles to generate coherent a...

متن کامل

TREC 2003 Novelty and Web Track at ICT

In this paper, we will present our approaches and experiments on the following two tracks of TREC-2003: Novelty track and Web track. The novelty track can be treated as a binary classification problem: relevant sentences vs. irrelevant sentences, or new vs. non-new. In this way, we applied variants of techniques that have been employed for text categorization problem. To retrieve the relevant s...

متن کامل

Novelty and Beyond: Towards Combined Motivation Models and Integrated Learning Architectures

For future intrinsically motivated agents to combine multiple intrinsic motivation or behavioural components, there is a need to identify fundamental units of motivation models that can be reused and combined to produce more complex agents. This chapter reviews three existing models of intrinsic motivation, novelty, interest and competence-seeking motivation, that are based on the neural networ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001