TikTok Parent Company ByteDance Open Sources Neural Speech Translation Toolkit

Recent work by the three authors — Chengqi Zhao, Mingxuan Wang, and Lei Li, all of ByteDance — includes PRUNE-TUNE, a new method of domain adaptation for machine translation (MT) domain adaptation, and multi-resolutional (MR) Doc2Doc, which the researchers used to train a neural sequence-to-sequence MT model for document-level translation.

Pro Guide Sales and Marketing for Language Service Provider and Translation and Localization Companies (Product)

Pro Guide: Sales and Marketing for Language Service Providers

36 pages. How LSPs generate leads, hire and compensate Sales staff, succeed in Digital Marketing, and benchmark against rivals.

$260 BUY NOW Included in our Pro and Enterprise plan.
Subscribe now!

As the paper explained, one of the shortcomings of traditional “cascade” speech translation systems is that mistakes in transcription — typically powered by automatic speech recognition — can cause errors in translation. End-to-end speech translation, on the other hand, bypasses the transcription step and produces less lag time.

The authors noted that studies on speech translation work on different datasets; their goal, therefore, was to establish reproducible and reliable benchmarks for the field. They said that NeurST’s “straightforward recipes for preprocessing audio datasets” will free up developers for more advanced work on speech translation.

Slator 2021 Data-for-AI Market Report

44-pages on how LSPs enter and scale in AI Data-as-a-service. Market overview, AI use cases, platforms, case studies, sales insights.

$380 BUY NOW Included in our Pro and Enterprise plan.
Subscribe now!

NeurST was put to the test on several benchmark speech translation tasks for eight European language pairs using publicly available speech translation data (namely the Augmented LibriSpeech and MuST-C corpora).

Overall, NeurST outperformed existing counterparts Espnet-ST and fairseq-ST in most languages. The authors hope the toolkit, which is meant to be NLP researcher-friendly, will be used to establish baselines in future studies.

Featured