Word-Order Issues in English-to-Urdu Statistical Machine Translation
نویسندگان
چکیده
منابع مشابه
Word-Order Issues in English-to-Urdu Statistical Machine Translation
We investigate phrase-based statistical machine translation between English and Urdu, two Indo-European languages that differ significantly in their word-order preferences. Reordering of words and phrases is thus a necessary part of the translation process. While local reordering is modeled nicely by phrase-based systems, long-distance reordering is known to be a hard problem. We perform experi...
متن کاملModel for English-Urdu Statistical Machine Translation
There are above 60 million first language speakers of Urdu and above 104 million second language speakers. Lot of knowledge on the internet available/useful to these speakers of Urdu is in English. The contrast in typology of both languages is interesting to study for Statistical Machine Translation. However, there is almost no parallel aligned data available freely for the selected language pa...
متن کاملEnglish to Urdu Statistical Machine Translation: Establishing a Baseline
The aim of this paper is to categorize and present the existence of resources for Englishto-Urdu machine translation (MT) and to establish an empirical baseline for this task. By doing so, we hope to set up a common ground for MT research with Urdu to allow for a congruent progress in this field. We build baseline phrase-based MT (PBMT) and hierarchical MT systems and report the results on 3 of...
متن کاملEvaluation of English-to-Urdu Machine Translation
This paper is based on the Evaluation of English to Urdu Machine Translation. Evaluation measures the quality characteristic of the Machine Translation output and is based on two approaches: Human Evaluation and Automatic Evaluation. In this paper, we are mainly concentrating over Human Evaluation. Machine Translation is an emerging research area in which human beings play a very crucial role. ...
متن کاملDeveloping English-Urdu Machine Translation Via Hindi
The paper presents a strategy for deriving English to Urdu translation using English to Hindi MT system. The English-Hindi lexical database is used to collect all possible Hindi words and phrases. These are further augmented by including their morphological variations and attaching all possible postpositions. This list is used to provide mapping from Hindi to Urdu. There may be change in gender...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Prague Bulletin of Mathematical Linguistics
سال: 2011
ISSN: 1804-0462,0032-6585
DOI: 10.2478/v10108-011-0007-0