Two-level Description of Turkish Morphology

نویسنده

  • Kemal Oflazer
چکیده

This paper describes a full two level morphological description of Turkish word structures The description has been implemented using the PC KIMMO environment and is based on a root word lexicon of about roots words The phonetic rules of contemporary Turkish spoken in Turkey have been encoded using two level rules while the morphotactics of the agglutinative word structures have been encoded as nite state machines for verbal nominal paradigms and other categories Almost all the special cases of and exceptions to phonological and morphological rules have been taken into account In this paper we describe the rules and the nite state machines along with examples and a discussion of how various special cases were handled We also describe some known limitations and problems with this description We then brie y describe various natural language processing applications that use this description as the morphological analysis component

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Freely Available Morphological Analyzer for Turkish

This paper presents TRmorph, a two-level morphological analyzer for Turkish. TRmorph is a fairly complete and accurate morphological analyzer for Turkish. However, strength of TRmorph is neither in its performance, nor in its novelty. The main feature of this analyzer is its availability. It has completely been implemented using freely available tools and resources, and the two-level descriptio...

متن کامل

Computer-assisted Learning of Turkish Morphology Draft 1.0; November 8; for Comments

We describe the design objectives, features, and the computational language model of a computer-mediated tool designed for learners of Turkish morphology. The underlying system is a generative grammar, more speciically, a computational word grammar that makes use of feature structures to deliver composition and decomposition of morphemes at the syntax-lexicon interface via two-level morphology....

متن کامل

Parsing Turkish with the Lexical Functional Grammar Formalism

This paper describes our work on parsing Turk-ish using the lexical-functional grammar formalism. This work represents the first effort for parsing Turkish. Our implementation is based on Tomita's parser developed at Carnegie-Mellon University Center for Machine Translation. The grammar covers a substantial subset of Turkish including simple and complex sentences, and deals with a reasonable am...

متن کامل

Turkish Language Resources: Morphological Parser, Morphological Disambiguator and Web Corpus

In this paper, we propose a set of language resources for building Turkish language processing applications. Specifically, we present a finite-state implementation of a morphological parser, an averaged perceptron-based morphological disambiguator, and compilation of a web corpus. Turkish is an agglutinative language with a highly productive inflectional and derivational morphology. We present ...

متن کامل

Root-Word Analysis of Turkish Emotional Language

This paper describes a model for the perceived emotion of Turkish sentences based on the emotions associated with the constituent words. In our model, each emotion is mapped to a point in the continuous space defined by three emotional attributes: valence, activation, and dominance. We collected a large data set through two independent surveys: a word-level survey that prompted users with emoti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993