Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
نویسندگان
چکیده
The development of deep learning techniques has allowed Neural Machine Translation (NMT) models to become extremely powerful, given sufficient training data and time. However, systems struggle when translating text from a new domain with distinct style or vocabulary. Fine-tuning on in-domain allows good adaptation, but requires relevant bilingual data. Even if this is available, simple fine-tuning can cause overfitting catastrophic forgetting previously learned behaviour. We survey approaches adaptation for NMT, particularly where system may need translate across multiple domains. divide into those revolving around selection generation, model architecture, parameter procedure, inference procedure. finally highlight the benefits multidomain other lines NMT research.
منابع مشابه
Fast Domain Adaptation for Neural Machine Translation
Neural Machine Translation (NMT) is a new approach for automatic translation of text from one human language into another. The basic concept in NMT is to train a large Neural Network that maximizes the translation performance on a given parallel corpus. NMT is gaining popularity in the research community because it outperformed traditional SMT approaches in several translation tasks at WMT and ...
متن کاملMulti-Domain Neural Machine Translation through Unsupervised Adaptation
We investigate the application of Neural Machine Translation (NMT) under the following three conditions posed by realworld application scenarios. First, we operate with an input stream of sentences coming from many different domains and with no predefined order. Second, the sentences are presented without domain information. Third, the input stream should be processed by a single generic NMT mo...
متن کاملSample-oriented Domain Adaptation for Image Classification
Image processing is a method to perform some operations on an image, in order to get an enhanced image or to extract some useful information from it. The conventional image processing algorithms cannot perform well in scenarios where the training images (source domain) that are used to learn the model have a different distribution with test images (target domain). Also, many real world applicat...
متن کاملInstance Weighting for Neural Machine Translation Domain Adaptation
Instance weighting has been widely applied to phrase-based machine translation domain adaptation. However, it is challenging to be applied to Neural Machine Translation (NMT) directly, because NMT is not a linear model. In this paper, two instance weighting technologies, i.e., sentence weighting and domain weighting with a dynamic weight learning strategy, are proposed for NMT domain adaptation...
متن کاملCost Weighting for Neural Machine Translation Domain Adaptation
In this paper, we propose a new domain adaptation technique for neural machine translation called cost weighting, which is appropriate for adaptation scenarios in which a small in-domain data set and a large general-domain data set are available. Cost weighting incorporates a domain classifier into the neural machine translation training algorithm, using features derived from the encoder repres...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Artificial Intelligence Research
سال: 2022
ISSN: ['1076-9757', '1943-5037']
DOI: https://doi.org/10.1613/jair.1.13566