Solving Thematic Divergences in Machine Translation
نویسنده
چکیده
Though most translation systems have some mechanism for translating certain types of divergent predicate-argument structures, they do not provide a genera] procedure that takes advantage of the relationship between lexical-semantic structure and syntactic structure. A divergent predicate-argument structure is one in which the predicate (e.g., the main verb) or its arguments (e.g., the subject and object) do not have the same syntactic ordering properties for both the source and target language. To account for such ordering differences, a machine translator must consider language-specific syntactic idiosyncrasies that distinguish a target language ¢rom a source language, while making use of lexical-semantic uniformities that tie the two languages together. This paper describes the mechanisms used by the UNITRAN machine translation system for mapping an underlying lexicalconceptual structure to a syntactic structure (and vice ¢erea), and it shows how these mechanisms coupled with a set of general linking routines solve the problem of thematic divergence in machine translation.
منابع مشابه
Identifying Semantic Divergences in Parallel Text without Annotations
Recognizing that even correct translations are not always semantically equivalent, we automatically detect meaning divergences in parallel sentence pairs with a deep neural model of bilingual semantic similarity which can be trained for any parallel corpus without any manual annotation. We show that our semantic model detects divergences more accurately than models based on surface features der...
متن کاملTranslation Divergences in Chinese-English Machine Translation: An Empirical Investigation
In this article, we conduct an empirical investigation of translation divergences between Chinese and English relying on a parallel treebank. To do this, we first devise a hierarchical alignment scheme where Chinese and English parse trees are aligned in a way that eliminates conflicts and redundancies between word alignments and syntactic parses to prevent the generation of spurious translatio...
متن کاملHandling Structural Divergences and Recovering Dropped Arguments in a Korean/english Machine Translation System ?
This paper describes an approach for handling structural divergences and recovering dropped arguments in an implemented Korean to English machine translation system. The approach relies on canonical predicate-argument structures (or dependency structures), which provide a suitable pivot representation for the handling of structural divergences and the recovery of dropped arguments. It can also ...
متن کاملDealing with Linguistic Divergences in English-Bhojpuri Machine Translation
In Machine Translation, divergence is one of the major barriers which plays a deciding role in determining the efficiency of the system at hand. Translation divergences originate when there is structural discrepancies between the input and the output languages. It can be of various types based on the issues we are addressing to such as linguistic, cultural, communicative and so on. Owing to the...
متن کاملDivergence patterns between English and Sanskrit Machine Translation
The translation divergence is a challenging problem in the area of machine translation. A detailed study of divergence issues in machine translation is required for their proper classification and detection. The language divergences between English and Sanskrit can be considered as representing the divergences between SVO (Subject Verb Object) and SOV (Subject Object Verb) classes of languages....
متن کامل