Proposals for a normalized representation of Standard Arabic full form lexica
نویسندگان
چکیده
Standardized lexical resources are an important prerequisite for the development of robust and wide coverage natural language processing application. Therefore, we applied the Lexical Markup Framework, a recent ISO initiative towards standards for designing, implementing and representing lexical resources, on a test bed of data for an Arabic full form lexicon. Besides minor structural accommodation that would be needed in order to take into account the traditional root-based organization of Arabic dictionaries, the LMF proposal appeared to be suitable to our purpose, especially because of the separate management of the hierarchical data structure (LMF core model) and elementary linguistic descriptors (data categories).
منابع مشابه
A prototype for projecting HPSG syntactic lexica towards LMF
The comparative evaluation of Arabic HPSG grammar lexica requires a deep study of their linguistic coverage. The complexity of this task results mainly from the heterogeneity of the descriptive components within those lexica (underlying linguistic resources and different data categories, for example). It is therefore essential to define more homogeneous representations, which in turn will enabl...
متن کاملThe Architecture Of A Standard Arabic Lexical Database: Some Figures, Ratios And Categories From The DIINAR.1 Source Program
This paper is a contribution to the issue – which has, in the course of the last decade, become critical – of the basic requirements and validation criteria for lexical language resources in Standard Arabic. The work is based on a critical analysis of the architecture of the DIINAR.1 lexical database, the entries of which are associated with grammar-lexis relations operating at word-form level ...
متن کاملMixed Strong Form Representation Particle Method for Solids and Structures
In this paper, a generalized particle system (GPS) method, a general method to describe multiple strong form representation based particle methods is described. Gradient, divergence, and Laplacian operators used in various strong form based particle method such as moving particle semi-implicit (MPS) method, smooth particle hydrodynamics (SPH), and peridynamics, can be described by the GPS metho...
متن کاملA Morphological Tagger for Standard Albanian
In this paper, we present a morphological tagger for standard Albanian intended as a component of an annotation tool in the context of the Albanian Corpus Initiative. The tagger uses off-line components for generating sub-regular and irregular word forms based on the verb inflector described in Trommer (1997) and simple morphological rules for main inflectional patterns. Part of the tagger are ...
متن کاملA morphological Analyzer for Standard Albanian
In this paper, we present a morphological analyzer for standard Albanian intended as a component of an annotation tool in the context of the Albanian Corpus Initiative. The analyzer uses off-line components for generating sub-regular and irregular word forms based on the verb inflector described in Trommer (1997) and simple morphological rules for main inflectional patterns. Part of the analyze...
متن کامل