Text collections for evaluation of Russian morphological taggers

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Morphological Analysis for Russian: Integration and Comparison of Taggers

In this paper we present a comparison of three morphological taggers for Russian with regard to the quality of morphological disambiguation performed by these taggers. We test the quality of the analysis in three different ways: lemmatization, POS-tagging and assigning full morphological tags. We analyze the mistakes made by the taggers, outline their strengths and weaknesses, and present a pos...

متن کامل

Manual for postediting Russian text

The present study is a practical guide to editors who refine partially machine-translated text as a basis for linguistic analysis. The posteditors' tasks are: to code preferred English equivalents, to code English structural symbols, to resolve grammatic properties, and to code syntactic connections (dependencies). A general introduction to the field of machine translation is contained in The R...

متن کامل

Russian Morphological Analysis

In this paper the approach to the organization of Russian inflexion morphologic model and its application for the Russian language morphological analysis and disambiguation are described. We are concerned with the pos tagging of 150-million-word Russian corpora. The approach is particularly dependent on the language processor Russicon, and on wide usage of Russicon's electronic dictionaries.

متن کامل

Simulating Morphological Analyzers with Stochastic Taggers for Confidence Estimation

We propose a method for providing stochastic confidence estimates for rule-based and black-box natural language (NL) processing systems. Our method does not require labeled training data: We simply train stochastic models on the output of the original NL systems. Numeric confidence estimates enable both minimum Bayes risk–style optimization as well as principled system combination for these kno...

متن کامل

Combining POS-taggers for improved accuracy on Swedish text

Several POS-taggers are trained and tested on Swedish text. Methods to improve the accuracy of the tagging are then examined. These methods include voting, letting taggers change their voting contribution depending on how confident they are and training a new second level classifier on the output of the taggers. All these methods are more accurate than the most accurate original tagger, with 15...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Linguistics/Jazykovedný casopis

سال: 2017

ISSN: 1338-4287,0021-5597

DOI: 10.1515/jazcas-2017-0035