Ad Hoc Retrieval Experiments Using WordNet and Automatically Constructed Thesauri
نویسندگان
چکیده
This paper describe our method in automatic-adhoc task of TREC-7. We propose a method to improve the performance of information retrieval system by expanded the query using 3 di ferent types of thesaurus. The expansion terms are taken from handcrafted thesaurus (WordNet), co-occurrence-based automatically constructed thesaurus, and syntactically predicate-argument based automatically constructed thesaurus.
منابع مشابه
Combining General Hand-Made and Automatically Constructed Thesauri for Query Expansion in Information Retrieval
One of the most intuitive ideas for enhancing the effectiveness of an information retrieval system is to include the use of a thesaurus. WordNet, as a hand-crafted and general-purpose thesaurus, intuitively should also work fine in information retrieval, but unfortunately, experimental results by many researchers have not been promising. Thereby in this paper we investigate why the use of WordN...
متن کاملAn Association Thesaurus for Information Retrieval
Although commonly used in both commercial and experimental information retrieval systems, thesauri have not demonstrated consistent beneets for retrieval performance, and it is diicult to construct a thesaurus automatically for large text databases. In this paper, an approach, called PhraseFinder, is proposed to construct collection-dependent association thesauri automatically using large full-...
متن کاملComplementing WordNet with Roget's and Corpus-based Thesauri for Information Retrieval
This paper proposes a method to overcome the drawbacks of WordNet when applied to information retrieval by complementing it with Roget 's thesaurus and corpus-derived thesauri. Words and relations which are not included in WordNet can be found in the corpus-derived thesauri. Effects of polysemy can be minimized with weighting method considering all query terms and all of the thesauri. Experimen...
متن کاملA Two-Stage Retrieval Model for the TREC-7 Ad Hoc Task
A two-stage model for ad hoc text retrieval is proposed in which recall and precision are maximized sequentially. The rst stage employs query expansion methods using WordNet and on a modi ed stemming algorithm. The second stage incorporates a term proximity-based scoring function and a prototype-based reranking method. The e ectiveness of the two-stage retrieval model is tested on the TREC-7 ad...
متن کاملFocused Search in Books and Wikipedia: Categories, Links and Relevance Feedback
In this paper we describe our participation in INEX 2009 in the Ad Hoc Track, the Book Track, and the Entity Ranking Track. In the Ad Hoc track we investigate focused link evidence, using only links from retrieved sections. The new collection is not only annotated with Wikipedia categories, but also with YAGO/WordNet categories. We explore how we can use both types of category information, in t...
متن کامل