Universal dependencies for Uyghur
نویسندگان
چکیده
The Universal Dependencies (UD) Project seeks to build a cross-lingual studies of treebanks, linguistic structures and parsing. Its goal is to create a set of multilingual harmonized treebanks that are designed according to a universal annotation scheme. In this paper, we report on the conversion of the Uyghur dependency treebank to a UD version of the treebank which we term the Uyghur Universal Dependency Treebank (UyDT). We present the mapping of the Uyghur dependency treebank’s labelling scheme to the UD scheme, along with a clear description of the structural changes required in this conversion.
منابع مشابه
An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملNoisy Uyghur Text Normalization
Uyghur is the second largest and most actively used social media language in China. However, a non-negligible part of Uyghur text appearing in social media is unsystematically written with the Latin alphabet, and it continues to increase in size. Uyghur text in this format is incomprehensible and ambiguous even to native Uyghur speakers. In addition, Uyghur texts in this form lack the potential...
متن کاملUyghur Medicine in Practice: A Study in Khotan.
In Xinjiang, China, health-seeking behaviors among the Uyghur are not restricted to visitation to doctors of modern medicine because traditional Uyghur medicine is at their disposal as well. As Xinjiang's southernmost city, Khotan is a thriving center of Uyghur medicine, bolstered by a Uyghur Medical College, a marketplace specializing in the trade of Uyghur drugs, and an assemblage of skilled ...
متن کاملThe development of Tagged Uyghur Corpus
The history and development of Uyghur language is introduced. After a brief introduction to the development of Uyghur words, morphology and syntax, we explain our developing of a computer-aided contemporary Uyghur language tagging system. The coverage of this corpus, the resources building, the rules for syncopating and tagging etyma and termination, and the tagging of a corpus using a small ta...
متن کاملUyghur food culture.
Uyghur food culture has a long history. It is rich in resources, with the strong characteristics of being "green" and healthy, and having high nutritional value. We analyze the development and current status of Uyghur food culture, and explore the value of developing this food culture's resources. Traditional Uyghur food culture formed with influences from many ethnic groups, and has evolved in...
متن کامل