Proposals for a normalized representation of Standard Arabic full form lexica

نویسندگان

  • Susanne Salmon-Alt
  • Amine Akrout
  • Laurent Romary
چکیده

Standardized lexical resources are an important prerequisite for the development of robust and wide coverage natural language processing application. Therefore, we applied the Lexical Markup Framework, a recent ISO initiative towards standards for designing, implementing and representing lexical resources, on a test bed of data for an Arabic full form lexicon. Besides minor structural accommodation that would be needed in order to take into account the traditional root-based organization of Arabic dictionaries, the LMF proposal appeared to be suitable to our purpose, especially because of the separate management of the hierarchical data structure (LMF core model) and elementary linguistic descriptors (data categories).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A prototype for projecting HPSG syntactic lexica towards LMF

The comparative evaluation of Arabic HPSG grammar lexica requires a deep study of their linguistic coverage. The complexity of this task results mainly from the heterogeneity of the descriptive components within those lexica (underlying linguistic resources and different data categories, for example). It is therefore essential to define more homogeneous representations, which in turn will enabl...

متن کامل

The Architecture Of A Standard Arabic Lexical Database: Some Figures, Ratios And Categories From The DIINAR.1 Source Program

This paper is a contribution to the issue – which has, in the course of the last decade, become critical – of the basic requirements and validation criteria for lexical language resources in Standard Arabic. The work is based on a critical analysis of the architecture of the DIINAR.1 lexical database, the entries of which are associated with grammar-lexis relations operating at word-form level ...

متن کامل

Mixed Strong Form Representation Particle Method for Solids and Structures

In this paper, a generalized particle system (GPS) method, a general method to describe multiple strong form representation based particle methods is described. Gradient, divergence, and Laplacian operators used in various strong form based particle method such as moving particle semi-implicit (MPS) method, smooth particle hydrodynamics (SPH), and peridynamics, can be described by the GPS metho...

متن کامل

A Morphological Tagger for Standard Albanian

In this paper, we present a morphological tagger for standard Albanian intended as a component of an annotation tool in the context of the Albanian Corpus Initiative. The tagger uses off-line components for generating sub-regular and irregular word forms based on the verb inflector described in Trommer (1997) and simple morphological rules for main inflectional patterns. Part of the tagger are ...

متن کامل

A morphological Analyzer for Standard Albanian

In this paper, we present a morphological analyzer for standard Albanian intended as a component of an annotation tool in the context of the Albanian Corpus Initiative. The analyzer uses off-line components for generating sub-regular and irregular word forms based on the verb inflector described in Trommer (1997) and simple morphological rules for main inflectional patterns. Part of the analyze...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005