Universal dependencies for Uyghur

نویسندگان

  • Marhaba Eli
  • Weinila Mushajiang
  • Tuergen Yibulayin
  • Kahaerjiang Abiderexiti
  • Yan Liu
چکیده

The Universal Dependencies (UD) Project seeks to build a cross-lingual studies of treebanks, linguistic structures and parsing. Its goal is to create a set of multilingual harmonized treebanks that are designed according to a universal annotation scheme. In this paper, we report on the conversion of the Uyghur dependency treebank to a UD version of the treebank which we term the Uyghur Universal Dependency Treebank (UyDT). We present the mapping of the Uyghur dependency treebank’s labelling scheme to the UD scheme, along with a clear description of the structural changes required in this conversion.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies

A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...

متن کامل

Noisy Uyghur Text Normalization

Uyghur is the second largest and most actively used social media language in China. However, a non-negligible part of Uyghur text appearing in social media is unsystematically written with the Latin alphabet, and it continues to increase in size. Uyghur text in this format is incomprehensible and ambiguous even to native Uyghur speakers. In addition, Uyghur texts in this form lack the potential...

متن کامل

Uyghur Medicine in Practice: A Study in Khotan.

In Xinjiang, China, health-seeking behaviors among the Uyghur are not restricted to visitation to doctors of modern medicine because traditional Uyghur medicine is at their disposal as well. As Xinjiang's southernmost city, Khotan is a thriving center of Uyghur medicine, bolstered by a Uyghur Medical College, a marketplace specializing in the trade of Uyghur drugs, and an assemblage of skilled ...

متن کامل

The development of Tagged Uyghur Corpus

The history and development of Uyghur language is introduced. After a brief introduction to the development of Uyghur words, morphology and syntax, we explain our developing of a computer-aided contemporary Uyghur language tagging system. The coverage of this corpus, the resources building, the rules for syncopating and tagging etyma and termination, and the tagging of a corpus using a small ta...

متن کامل

Uyghur food culture.

Uyghur food culture has a long history. It is rich in resources, with the strong characteristics of being "green" and healthy, and having high nutritional value. We analyze the development and current status of Uyghur food culture, and explore the value of developing this food culture's resources. Traditional Uyghur food culture formed with influences from many ethnic groups, and has evolved in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016