The Case for Contextual Evaluation in Measuring LLM Translation Quality
At SlatorCon Zurich, Dr. Sheila Castilho emphasizes the significance of contextual evaluation in assessing large...
At SlatorCon Zurich, Dr. Sheila Castilho emphasizes the significance of contextual evaluation in assessing large...
MATEO aims to open up machine translation evaluation, making it accessible to more stakeholders and...
As machine translation gets better, the problem of bias — especially gender bias — remains...
Social media giant Meta proposes new metric to enable more consistency in human evaluation of...
To be fair, this is the first-ever multilingual model to win the international machine translation...
Researchers tracked CO2 emissions from training German, French, and English machine translation engines, and found...
Combing through papers released from 2010 to 2020, researchers identify overreliance on metric BLEU to...
Facebook, Naver, and George Mason University release open-sourced metric based on Covid-19 terms; hope to...
Scientists propose new system to standardize automatic evaluation of NLP systems by adding humans in...
Though beam search can boost BLEU scores, it can also lead to high rates of...
The most-cited neural machine translation research papers show how NMT came to dominate the field...