Judging relevance through identification of lexical chains

نویسندگان

  • Steven Ngai
  • Matthew Holliman
چکیده

Lexical chaining is a method for encapsulating the meaning of a document in so-called lexical chains. The presence and concentration of such chains can be used to judge similarities both within and between documents. We present a system that performs such chaining and, using a small corpus of news articles, compare its performance with that of a naive vector-space system. We then investigate the utility of such automated similarity judgments for information retrieval tasks such as generating inter-document links and processing queries.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Document relevance calculation based on Lexical cohesion with structure analysis

This paper explores the feasibility of constructing a document relevance calculating model based on lexical cohesion with structure analysis. In this model, by extracting the semanticrelative word clusters in documents according to the lexicon cohesion principle, documents are formalized in expressions which are composed of lexicon chains with structure information. And based on this kind of re...

متن کامل

Can Automatic Abstracting Improve on Current Extracting Techniques in Aiding Users to Judge the Relevance of Pages in Search Engine Results?

Current search engines use sentence extraction techniques to produce snippet result summaries, which users may find less than ideal for determining the relevance of pages. Unlike extracting, abstracting programs analyse the context of documents and rewrite them into informative summaries. Our project aims to produce abstracting summaries which are coherent and easy to read thereby lessening use...

متن کامل

Automatic Text Summarization Using Lexical Chains: Algorithms and Experiments

Summarization is a complex task that requires understanding of the document con­ tent to determine the importance of the text. Lexical cohesion is a method to identify connected portions of the text based on the relations between the words in the text. Lexical cohesive relations can be represented using lexical chains. Lexical chains are sequences of semantically related words spread over the e...

متن کامل

Lexical Chains and Sliding Locality Windows in Content-based Text Similarity Detection

We present a system to determine content similarity of documents. Our goal is to identify pairs of book chapters that are translations of the same original chapter. Achieving this goal requires identification of not only the different topics in the documents but also of the particular flow of these topics. Our approach to content similarity evaluation employs ngrams of lexical chains and measur...

متن کامل

Linguistic Means of Description of Family Relations in the Novel “In Chancery” By J. Galsworthy

The article is devoted to the study of the evaluative component of the meaning of lexical means used to describe relations between family members in the novel “In Chancery” by J. Galsworthy. The relevance of t &he study can be attributed to the lack of works devoted to this problem. As the results of our study demonstrate, the words of the lexical-semantic group “family” were mainly used to ver...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002