Clustering Top-Ranking Sentences for Information Access

نویسندگان

  • Anastasios Tombros
  • Joemon M. Jose
  • Ian Ruthven
چکیده

In this paper we propose the clustering of top-ranking sentences (TRS) for effective information access. Top-ranking sentences are selected by a query-biased sentence extraction model. By clustering such sentences, we aim to generate and present to users a personalised information space. We outline our approach in detail and we describe how we plan to utilise user interaction with this space for effective information access. We present an initial evaluation of TRS clustering by comparing its effectiveness at providing access to useful information to that of document clustering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using top-ranking sentences to facilitate effective information access

Web searchers typically fail to view search results beyond the first page nor fully examine those results presented to them. In this article we describe an approach that encourages a deeper examination of the contents of the document set retrieved in response to a searcher’s query. The approach shifts the focus of perusal and interaction away from potentially uninformative document surrogates (...

متن کامل

Clustering the Information Space Using Top-Ranking Sentences: A Study of User Interaction

By considering sentences selected by a query-biased sentence extraction model from the top-retrieved documents, we create a personalised information space which is characterised by the presence of search terms. We cluster this information space, and enable searchers to interact with the resulting clusters. In order to examine whether users can recognise, and benefit from, the clustered organisa...

متن کامل

Simultaneous Ranking and Clustering of Sentences: A Reinforcement Approach to Multi-Document Summarization

Multi-document summarization aims to produce a concise summary that contains salient information from a set of source documents. In this field, sentence ranking has hitherto been the issue of most concern. Since documents often cover a number of topic themes with each theme represented by a cluster of highly related sentences, sentence clustering was recently explored in the literature in order...

متن کامل

Ranking With Cluster-Based Non-Segregated Approach to Multi-Document Categorization

To summarization of one or more document aims to create a strong summary while retaining the main characteristics of the original set of documents. To cover a number of topic with each theme represented by a cluster of highly related sentences. Sentence clustering is used, it directly generates clusters integrated with ranking. Ranking distribution for sentence in each and every cluster is diff...

متن کامل

Manifold-Ranking Based Topic-Focused Multi-Document Summarization

Topic-focused multi-document summarization aims to produce a summary biased to a given topic or user profile. This paper presents a novel extractive approach based on manifold-ranking of sentences to this summarization task. The manifold-ranking process can naturally make full use of both the relationships among all the sentences in the documents and the relationships between the given topic an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003