A Multilingual Semantic Search Assistant to Facilitate Information Retrieval

نویسندگان

  • Soonho Kim
  • Jeetendra Singh
  • Ze Li
  • Andrew Bagdanov
  • Ajay Krishnan
  • Margherita Sini
  • Johannes Keizer
  • T. V. Prabhakar
  • Zhong Wang
چکیده

This paper describes a multilingual semantic search assistant (MSSA) that could help users find proper keywords in FAO fulltext-based search engines. The MSSA implements four independent functions each associated with the AGROVOC multilingual thesaurus to assist users in choosing better keywords: a Venn diagram-based Boolean search interface, an animated AGROVOC concept browser, cross-language query expansion for five official FAO languages, and domain-specific synonym expansion. A usability study of cross-language support and domain-specific synonym expansion was performed through a comparative evaluation of precision and recall in the WAICENT Information Finder. The cross-language support functionality yielded a significant increase in recall without any decrease in precision when compared to a simple search in. Domain-specific synonym expansion, on the other hand, made no significant difference in precision and recall. We suggest some ways to improve the performance of MSSA. Keyword: AGROVOC, multilingual semantic search, search assistant, multilingual, semantic, information retrieval, Food and Agriculture Organization of the United Nation, WAICENT Information Finder, Web Services, Boolean search, topic browser, cross-language query expansion, synonym expansion

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cross-Language Hybrid Keyword and Semantic Search

The growth of multilingual web content and increasing internationalization portends the need for cross-language information retrieval. As a solution to this problem for narrow-domain, data-rich web content, we offer ML-HyKSS: MultiLingual Hybrid Keyword and Semantic Search. The key component of ML-HyKSS is a collection of linguistically grounded conceptual-model instances called extraction onto...

متن کامل

Cross-Lingual Information Retrieval and Semantic Interoperability for Cultural Heritage Repositories

This paper describes a computational linguistics-based approach for providing interoperability between multi-lingual systems in order to overcome crucial issues like cross-language and cross-collection retrieval. Our proposal is a system which improves capabilities of language-technology-based information extraction. In the last few years various theories have been developed and applied for mak...

متن کامل

Supporting Multilingual Internet Searching and Browsing

The amount of non-English information has proliferated rapidly in recent years. The broad diversity of the multilingual content presents a substantial research challenge in the field of knowledge discovery and information retrieval. Therefore there is an increased interest in the development of multilingual systems to support information sharing across languages. The goal of this dissertation i...

متن کامل

Improving Knowledge Representation to Speed up the Generation of Grammars for a Multilingual Web Assistant

This paper describes the use of a syntactico-semantic taxonomy to facilitate the generation of grammars for a multilingual web assistant. In particular, it describes the generation of grammars for two different domains: cultural events and medical specialists.

متن کامل

Multi-word processing in an ontology-based Cross-Language Information Retrieval model for specific domain collections

This paper proposes a methodological approach to CLIR applications for the development of a system which improves multi-word processing when specific domain translation is required. The system is based on a multilingual ontology, which can improve both translation and retrieval accuracy and effectiveness. The proposed framework allows mapping data and metadata among language-specific ontologies...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008