Researchers working on machine translation (MT) often rely on back translation to beef up training data. Back translation — when more widely available monolingual target language data is translated into the source language — was credited with enabling Transformer-based deep-learning system CUBITT to “outperform human-level translation” as covered by Slator in September.
Back translation was also crucial to a method for detecting machine translated content, which may become more critical as startups ramp up commercialization of AI-powered text generation.
The usefulness of back translation depends on the widespread availability of target language data, which can present a hurdle for languages of lesser diffusion.

