Russian Morphological Analysis
نویسنده
چکیده
In this paper the approach to the organization of Russian inflexion morphologic model and its application for the Russian language morphological analysis and disambiguation are described. We are concerned with the pos tagging of 150-million-word Russian corpora. The approach is particularly dependent on the language processor Russicon, and on wide usage of Russicon's electronic dictionaries.
منابع مشابه
Morphological Analysis for Russian: Integration and Comparison of Taggers
In this paper we present a comparison of three morphological taggers for Russian with regard to the quality of morphological disambiguation performed by these taggers. We test the quality of the analysis in three different ways: lemmatization, POS-tagging and assigning full morphological tags. We analyze the mistakes made by the taggers, outline their strengths and weaknesses, and present a pos...
متن کاملMorphological Analyzer and Generator for Russian and Ukrainian Languages
pymorphy2 is a morphological analyzer and generator for Russian and Ukrainian languages. It uses large efficiently encoded lexicons built from OpenCorpora and LanguageTool data. A set of linguistically motivated rules is developed to enable morphological analysis and generation of out-of-vocabulary words observed in real-world documents. For Russian pymorphy2 provides state-of-the-arts morpholo...
متن کاملMorphological and AFLP-Based Genetic Diversity Assessment of Elaeagnus angustifolia L.
Genetic diversity among Russian olive genotypes in three different regions of East-Azerbaijan province (includes Tabriz, Maragheh and Malekan) were assessed using morphological and molecular (AFLP) markers. Results of the quantitative and qualitative traits statistics showed a significant genetic variation among studied germplasm and categorized them in five distinguished groups. The most numbe...
متن کاملA resource-light approach to morpho-syntactic tagging.Anna Feldman and Jirka Hana
Anna Feldman and Jirka Hana had a problem. Wanting to extract Russian verb frames, they lacked a tool for the necessary first step: morphological analysis of Russian words, disambiguated for context. To avoid the significant overhead of building a contextual-ized morphological analyzer from scratch, Feldman and Hana wondered if an analyzer that was already available for Czech would perform adeq...
متن کاملA Resource-light Approach to Russian Morphology: Tagging Russian using Czech resources
In this paper, we describe a resource-light system for the automatic morphological analysis and tagging of Russian. We eschew the use of extensive resources (particularly, large annotated corpora and lexicons), exploiting instead (i) pre-existing annotated corpora of Czech; (ii) an unannotated corpus of Russian. We show that our approach has benefits, and present what we believe to be one of th...
متن کامل