Machine translation quality estimation (MTQE) is firmly in the spotlight as big tech is experimenting with ways to auto-evaluate machine translation output. The focus on MTQE is justified. Leveraging LLMs to analyze machine translation output, and to predict which segments need human review, gives language service providers (LSPs) and internal localization teams managing large content volumes the opportunity to add significant value to existing machine translation workflows.
LSPs and localization teams are faced with a growing demand for machine translation services with ever-shorter turnaround times. Understanding which documents provide better or worse machine translation output, and the effort involved to post-edit, is still a manual task for both LSPs, localization teams, and linguists.
With LLMs, assessing machine translation output automatically is now within reach, although it is not without its complications. At this year’s SlatorCon, a panel of experts discussed how LLMs are still “off-the-shelf starting points” that will need to be fine-tuned with the help of translators.

