Google Launches MedASR, an Open Medical Speech-to-Text Model
Google introduces MedASR, an open-weight medical speech-to-text model positioned as a foundational layer for healthcare...
Google introduces MedASR, an open-weight medical speech-to-text model positioned as a foundational layer for healthcare...
Meta AI rolls out Omnilingual ASR, an open-source automatic speech recognition suite covering 1,600+ languages...
Hugging Face, NVIDIA, Mistral AI, and the University of Cambridge launch the Open ASR Leaderboard,...
In just a few weeks, Alibaba’s Qwen team has rolled out models for automatic speech...
Alibaba’s Qwen team unveils a new speech recognition model covering 11 languages and multiple accents,...
Stanford and UC Santa Cruz launch a benchmark for audio-language models, with Google’s Gemini 2.5...
At IWSLT 2025, researchers and industry share ways to boost speed, quality, subtitles, model size,...
MiniMax is one of China’s “Six Tigers” — six AI firms at the top of...
A new Google study finds that multilingual speech datasets suffer from serious data quality issues...
IIT Bombay researchers propose a new approach to speech-to-speech translation that not only translates speech...
Inspired Spine’s SURI converts multilingual doctor-patient conversations into structured medical reports using automatic speech recognition...