Using Arabic Wordnet for semantic indexation in information retrieval system

نویسندگان

  • Mohammed Alaeddine Abderrahim
  • Mohammed El Amine Abderrahim
  • Amine Chikh
چکیده

In the context of arabic Information Retrieval Systems (IRS) guided by arabic ontology and to enable those systems to better respond to user requirements, this paper aims to representing documents and queries by the best concepts extracted from Arabic Wordnet. Identified concepts belonging to Arabic WordNet synsets are extracted from documents and queries, and those having a single sense are expanded. The expanded query is then used by the IRS to retrieve the relevant documents searched. Our experiments are based primarily on a medium size corpus of arabic text. The results obtained shown us that there are a global improvement in the performance of the arabic IRS.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

A concept-based approach for indexing documents in IR

This paper addresses two important problems related to the use of semantics in IR. The first one concerns the representation of document semantics and its proper use in retrieval. The second is the integration of semantic-based retrieval with "traditional" keywords-based retrieval. The proposed approach aims to represent the document content by the best semantic network called document semantic...

متن کامل

A Cross-language Information Retrieval Based on an Arabic Ontology in the Legal Domain

In this paper, we describe a web-based multilingual tool for Arabic information retrieval based on ontology in the legal domain. We illustrate the manual construction of the ontology and the way it is edited using Protégé2000. Using Arabic (UN) documents we identify the legal terms and the semantic relations between them before mapping them onto their position in the ontology. The process of se...

متن کامل

Semiautomatic Image Retrieval Using the High Level Semantic Labels

Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...

متن کامل

Information Retrieval: Applications to English and Arabic Documents

Arabic information retrieval has become a focus of research and commercial development due to the vital necessity of such tools for people in the electronic age. The number of Arabicspeaking Internet users is assumed to achieve 43 millions during this year; however, on the other side, few full search engines are available to Arabic-speaking users. This dissertation focuses on three naturally re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1306.2499  شماره 

صفحات  -

تاریخ انتشار 2013