Tag-less back-translation

نویسندگان

چکیده

An effective method to generate a large number of parallel sentences for training improved neural machine translation (NMT) systems is the use back-translations target-side monolingual data. The standard back-translation has been shown be unable efficiently utilize available huge amount existing data because inability models differentiate between authentic and synthetic during training. Tagging, or using gates, used enable distinguish data, improving also enabling iterative on language pairs that underperformed back-translation. In this work, we approach as domain adaptation problem, eliminating need explicit tagging. -- \emph{tag-less back-translation} are treated out-of-domain in-domain respectively and, through pre-training fine-tuning, model able learn more from them Experimental results have outperforms tagged approaches low resource English-Vietnamese English-German translation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Tag Path Authentication Protocol with Less Tag Memory

Logistical management has been advanced rapidly in these years, taking advantage of the broad connectivity of the Internet. As it becomes an important part of our lives, it also raises many challenging issues, e.g., the counterfeits of expensive goods pose a serious threat to supply chain management. As a result, path authentication becomes especially important in supply chain management, as it...

متن کامل

"Difficult back", turns into "less difficult back" by ultrasonography

Corresponding author: Yoon-Hee Kim, M.D., Ph.D., Department of Anesthesiology and Pain Medicine, College of Medicine, Chungnam National University, Munhwa-ro, Jung-gu, Daejeon 301-721, Korea. Tel: 82-42-280-7840, Fax: 82-42-280-7968, E-mail: [email protected] This is an open-access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http:// creative...

متن کامل

A Kinect-less Augmented Reality Approach to Real-time Tag-less Virtual Trial Room Simulation

The Virtual Trial Room (VTR) application software simulates an apparel dressing room by the implementation of a virtual mirror, portraying an augmented view of the user with virtual superimposed clothes. Traditional approach to the design and implementation of virtual dressing rooms have been wildly using either normal webcams with Tag/Marker based tracking or expensive 3D depth & motion sensin...

متن کامل

Translation and back-translation in qualitative nursing research: methodological review.

AIMS To examine the effects of the procedure of translation and the techniques used on the collection and interpretation of original language qualitative data for English presentation. BACKGROUND Nursing and health research increasingly use qualitative research for a broadened perspective on practice and research. In numerous qualitative nursing research papers, data are collected in the orig...

متن کامل

Back-Translation for Discovering Distant Protein Homologies

Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins’ common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level. To cope with this situation, we pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Machine Translation

سال: 2021

ISSN: ['0922-6567', '1573-0573']

DOI: https://doi.org/10.1007/s10590-021-09284-y