ACL2021

A Comparison between Pre-training and Large-scale Back-translation for Neural Machine Translation

Dandan Huang, Kun Wang, Yue Zhang

Abstract

We present iterative back-translation, a method for generating increasingly better synthetic parallel data from monolingual data to train neural machine translation systems. Our proposed method is very simple yet effective and highly applicable in practice. We demonstrate improvements in neural machine translation quality in both high and low resourced scenarios, including the best reported BLEU scores for the WMT 2017 German↔English tasks.