Chinese Information Retrieval Based on Terms and Ontology

نویسندگان

  • Lingpeng Yang
  • Dong-Hong Ji
  • Li Tang
چکیده

In this paper, we describe our approach for single language information retrieval (SLIR) on Chinese language of NTCIR4 tasks. Firstly, we automatically extract terms (short-terms and long terms) from document set and use them to build indexes; secondly, for a query, we use short terms in the query and documents to do initial retrieval; thirdly, we build an ontology for the query to do query expansion and implement second retrieval. Finally, we use long terms to reorder the top N retrieved documents. Experiments show that the method achieves good results for both T-run and D-Run SLIR tasks of Chinese language.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Public Transport Ontology for Passenger Information Retrieval

Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...

متن کامل

IAALD AFITA WCCA2008 WORLD CONFERENCE ON AGRICULTURAL INFORMATION AND IT Thesaurus and Ontology Technology for the Improvement of Agricultural Information Retrieval

We have been in a web information stage, by new information management technologies, we can get better agricultural development. The paper introduces the research work on agricultural thesaurus and ontology; it could improve the agricultural information retrieval. Main work include to convert Chinese Agricultural Thesaurus (CAT) to the agricultural ontology, this can use traditional domain know...

متن کامل

Developing a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information

With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...

متن کامل

Semantic Term Based Information Retrieval Using Ontology

Information Searching and retrieval is a challenging task in the traditional keyword based textual information retrieval system. In the growing information age, adding huge data every day the searching problem also augmented. Keyword based retrieval system returns bulk of junk document irrelevant to query. To address the limitations, this paper proposed query terms along with semantic terms for...

متن کامل

توسعه هستانشناسی فرایندمحور برای فناوریهای مدیریت دانش

This paper is an attempt to develop a new ontology for knowledge management (KM) technologies, determining the relationships between these technologies and classification of them. The study applies NOY methodology. Protégé software and OWL language are used for building the ontology. The presented ontology is evaluated with abbreviation and consistency criteria and knowledge retrieval of KM tec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004