Using Arabic Wordnet for semantic indexation in information retrieval system
نویسندگان
چکیده
In the context of arabic Information Retrieval Systems (IRS) guided by arabic ontology and to enable those systems to better respond to user requirements, this paper aims to representing documents and queries by the best concepts extracted from Arabic Wordnet. Identified concepts belonging to Arabic WordNet synsets are extracted from documents and queries, and those having a single sense are expanded. The expanded query is then used by the IRS to retrieve the relevant documents searched. Our experiments are based primarily on a medium size corpus of arabic text. The results obtained shown us that there are a global improvement in the performance of the arabic IRS.
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملA concept-based approach for indexing documents in IR
This paper addresses two important problems related to the use of semantics in IR. The first one concerns the representation of document semantics and its proper use in retrieval. The second is the integration of semantic-based retrieval with "traditional" keywords-based retrieval. The proposed approach aims to represent the document content by the best semantic network called document semantic...
متن کاملA Cross-language Information Retrieval Based on an Arabic Ontology in the Legal Domain
In this paper, we describe a web-based multilingual tool for Arabic information retrieval based on ontology in the legal domain. We illustrate the manual construction of the ontology and the way it is edited using Protégé2000. Using Arabic (UN) documents we identify the legal terms and the semantic relations between them before mapping them onto their position in the ontology. The process of se...
متن کاملSemiautomatic Image Retrieval Using the High Level Semantic Labels
Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...
متن کاملInformation Retrieval: Applications to English and Arabic Documents
Arabic information retrieval has become a focus of research and commercial development due to the vital necessity of such tools for people in the electronic age. The number of Arabicspeaking Internet users is assumed to achieve 43 millions during this year; however, on the other side, few full search engines are available to Arabic-speaking users. This dissertation focuses on three naturally re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1306.2499 شماره
صفحات -
تاریخ انتشار 2013