Better Alignments = Better Translations?
نویسندگان
چکیده
Automatic word alignment is a key step in training statistical machine translation systems. Despite much recent work on word alignment methods, alignment accuracy increases often produce little or no improvements in machine translation quality. In this work we analyze a recently proposed agreementconstrained EM algorithm for unsupervised alignment models. We attempt to tease apart the effects that this simple but effective modification has on alignment precision and recall trade-offs, and how rare and common words are affected across several language pairs. We propose and extensively evaluate a simple method for using alignment models to produce alignments better-suited for phrase-based MT systems, and show significant gains (as measured by BLEU score) in end-to-end translation systems for six languages pairs used in recent MT competitions. Comments Reprinted from: Better Alignments = Better Translations? Kuzman Ganchev, Joao Graca and Ben Taskar. In Proceedings of the 46th Annual Meeting of the Association of Computational Linguistics. Columbus, Ohio, June 16-17, 2008. This conference paper is available at ScholarlyCommons: http://repository.upenn.edu/grasp_papers/42 Better Alignments = Better Translations? Kuzman Ganchev Computer & Information Science University of Pennsylvania João V. Graça LF INESC-ID Lisboa, Portugal Ben Taskar Computer & Information Science University of Pennsylvania
منابع مشابه
Machine Translation System Combination using ITG-based Alignments
Given several systems’ automatic translations of the same sentence, we show how to combine them into a confusion network, whose various paths represent composite translations that could be considered in a subsequent rescoring step. We build our confusion networks using the method of Rosti et al. (2007), but, instead of forming alignments using the tercom script (Snover et al., 2006), we create ...
متن کاملExploiting ontologies and alignments for trust in semantic P2P networks
In a semantic P2P network, peers use separate ontologies and rely on alignments between their ontologies for translating queries. However, alignments may be limited —unsound or incomplete— and generate flawed translations, and thereby produce unsatisfactory answers. In this paper we propose a trust mechanism that can assist peers to select those in the network that are better suited to answer t...
متن کاملImproving Semantic SMT via Soft Semantic Role Label Constraints on ITG Alignments
We show that applying semantic role label constraints to bracketing ITG alignment to train MT systems improves the quality of MT output in comparison to the conventional BITG and GIZA alignments. Moreover, we show that applying soft constraints to SRL-constrained BITG alignment leads to a better translation system compared to using hard constraints which appear too harsh to produce meaningful b...
متن کاملMatt: Local Flexibility Aids Protein Multiple Structure Alignment
Even when there is agreement on what measure a protein multiple structure alignment should be optimizing, finding the optimal alignment is computationally prohibitive. One approach used by many previous methods is aligned fragment pair chaining, where short structural fragments from all the proteins are aligned against each other optimally, and the final alignment chains these together in geome...
متن کاملThe Regression Model of Machine Translation
Machine translation is the task of automatically nding the translation of a source sentence in the target language. Statistical machine translation (SMT) use parallel corpora or bilingual paired corpora that are known to be translations of each other to nd a likely translation for a given source sentence based on the observed translations. The task of machine translation can be seen as an insta...
متن کامل