Recent work by the three authors — Chengqi Zhao, Mingxuan Wang, and Lei Li, all of ByteDance — includes PRUNE-TUNE, a new method of domain adaptation for machine translation (MT) domain adaptation, and multi-resolutional (MR) Doc2Doc, which the researchers used to train a neural sequence-to-sequence MT model for document-level translation.
Pro Guide: Sales and Marketing for Language Service Providers
36 pages. How LSPs generate leads, hire and compensate Sales staff, succeed in Digital Marketing, and benchmark against rivals.
As the paper explained, one of the shortcomings of traditional “cascade” speech translation systems is that mistakes in transcription — typically powered by automatic speech recognition — can cause errors in translation. End-to-end speech translation, on the other hand, bypasses the transcription step and produces less lag time.
The authors noted that studies on speech translation work on different datasets; their goal, therefore, was to establish reproducible and reliable benchmarks for the field. They said that NeurST’s “straightforward recipes for preprocessing audio datasets” will free up developers for more advanced work on speech translation.
Slator 2021 Data-for-AI Market Report
44-pages on how LSPs enter and scale in AI Data-as-a-service. Market overview, AI use cases, platforms, case studies, sales insights.
NeurST was put to the test on several benchmark speech translation tasks for eight European language pairs using publicly available speech translation data (namely the Augmented LibriSpeech and MuST-C corpora).
Overall, NeurST outperformed existing counterparts Espnet-ST and fairseq-ST in most languages. The authors hope the toolkit, which is meant to be NLP researcher-friendly, will be used to establish baselines in future studies.