Fuzzy clustering for indexing in the GAMBAL information retrieval system

نویسندگان

  • Vicenç Torra
  • Sergi Lanau
  • Sadaaki Miyamoto
چکیده

Gambal is an information retrieval system for indexing and accessing web pages that includes graphical interfaces to ease web page search and accessing. In particular, the interfaces provide the user with tools for navigating through hierarchies of documents and visualize selected documents and similar ones. Here, similarity is either based on Wordnet 1.7 or Latent Semantics Analysis. Graphical interfaces include both Hierarchical Spherical Clustering (HSC) and Hierarchical Self Organizing Maps (HSOM). In this work we introduce the use of fuzzy clustering for indexing in the HSC interface.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploration of textual document archives using a fuzzy hierarchical clustering algorithm in the GAMBAL system

The Internet, together with the large amount of textual information available in document archives, has increased the relevance of information retrieval related tools. In this work we present an extension of the Gambal system for clustering and visualization of documents based on fuzzy clustering techniques. The tool allows to structure the set of documents in a hierarchical way (using a fuzzy ...

متن کامل

Fuzzy Clustering Method for Content-based Indexing

E cient and accurate information retrieval is one of the main issues in multimedia databases. In content-based multimedia retrieval databases, contents or features of the database objects are used for retrieval. To retrieve similar database objects, we often perform a nearest-neighbor search. A nearest-neighbor search is used to retrieve similar database objects with features nearest to the que...

متن کامل

Fuzzy C-Means Clustering for Biomedical Documents Using Ontology Based Indexing and Semantic Annotation

Search is the most obvious application of information retrieval. The variety of widely obtainable biomedical data is enormous and is expanding fast. This expansion makes the existing techniques are not enough to extract the most interesting patterns from the collection as per the user requirement. Recent researches are concentrating more on semantic based searching than the traditional term bas...

متن کامل

Using Natural Clusters Information to Build Fuzzy Indexing Structure

Efficient and accurate information retrieval is one of the main issues in multimedia databases. However, the key for this is how to build an efficient indexing structure. In this paper, we demonstrate how to use a fuzzy clustering algorithm, Sequential Fuzzy Competitive Clustering (SFCC), to get the natural clusters information from the data. Then use the information to build an efficient index...

متن کامل

Fuzzy Ontology and Information Access on the Web

Web is the largest available repository of data. In this contribution a solved application of Fuzzy set theory technique to the definition of flexible systems for locating and accessing information on the Web is presented. A purpose of our research is also a fact, that there are various ways to access the big amount of available and mostly unknown information for users. Clustering methods are a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003