Thesaurus Based on Grey Theory for Interactive Query Expansion

نویسندگان

  • Hahn-Ming Lee
  • Chi-Chun Huang
  • Yong-Cheng Chen
چکیده

With the increasing availability of information on the WWW (World Wide Web), it becomes more important and feasible to retrieve information efficiently. Query expansion is the process of supplementing an original query with additional terms in order to refine a search and increase retrieval effectiveness. If the query expansion is interactive, then the user and the system work together to expand the query. A well-designed thesaurus not only can represent the semantic knowledge among terms, but also can help to improve the retrieval performance in search engine. In previous researches, many thesauruses are represented as symmetric type, and this lead to some unreasonable results. Therefore, in this study, we propose an interactive searching scheme that aims to provide users an easy way to retrieve their desired information. A non-symmetric association thesaurus, which is generated by Sparse Grey Relational Analysis (SGRA) algorithm we proposed, is involved in query expansion process. Also, the SGRA algorithm improves the problem that traditional grey relational analysis cannot deal with sparse matrix, and to reduce the noise term. Experiment show that the searching scheme of expanding query by association thesaurus has improved retrieval performance slightly. We also find that the correctness of word segmentation and the completeness of dataset have a great impact on the quality of association thesaurus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Association Thesaurus Construction for Interactive Query Expansion Based on Association Rule Mining

This paper presents an interactive query expansion method with association thesaurus, which is mined from the ‘selected web pages’ of users in the query logs. The ‘selected web pages’ of users are transferred into ‘sets of query terms’ and then used for term correlation mining. Accordingly, various association thesauruses concerning different query terms are constructed from these term correlat...

متن کامل

Regression Model and Query Expansion for NTCIR-2 Ad Hoc Retrieval Task

This paper describes procedures and results in a monolingual retrieval experiment using NTCIR-2 test collection. First, we discuss a simplified logistic regression model, which enable us to adjust the regression model for working well in each of various document databases. To do automatically the adjustment, a method for estimating parameters in the regression model, is developed based on a kin...

متن کامل

Fuzzy Rough Set Based Web Query Expansion

Fuzzy rough set theory is a candidate framework for query refinement. Indeed, a thesaurus defines an approximation space in which the query, which is a set of terms, can be approximated from the upper and the lower side. The upper approximation turns out to be too flexible however, resulting in query explosion, while the lower approximation is too strict, resulting in the empty query. Therefore...

متن کامل

The Exploration and Analysis of Using Multiple Thesaurus Types for Query Expansion in Information Retrieval

This paper proposes the use of multiple thesaurus types for query expansion in information retrieval. Hand-crafted thesaurus, corpus-based co-occurrence-based thesaurus and syntactic-relation-based thesaurus are combined and used as a tool for query expansion. A simple word sense disambiguation is performed to avoid misleading expansion terms. Experiments using TREC-7 collection proved that thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002