#automatic speech recognition

Array ( [post_type] => post [posts_per_page] => 12 [paged] => 2 [post_status] => publish [tax_query] => Array ( [relation] => AND [0] => Array ( [taxonomy] => post_tag [field] => term_id [terms] => 16399 [operator] => IN ) ) )

Google Launches MedASR, an Open Medical Speech-to-Text Model

Google introduces MedASR, an open-weight medical speech-to-text model positioned as a foundational layer for healthcare...

What is Meta’s Omnilingual ASR?

Meta AI rolls out Omnilingual ASR, an open-source automatic speech recognition suite covering 1,600+ languages...

NVIDIA, Microsoft, ElevenLabs Top New Automatic Speech Recognition Leaderboard

Hugging Face, NVIDIA, Mistral AI, and the University of Cambridge launch the Open ASR Leaderboard,...

Alibaba Triples Down on Speech, Translation, Multimodal AI, New Model Launches Show

In just a few weeks, Alibaba’s Qwen team has rolled out models for automatic speech...

Alibaba’s New Speech Recognition Model Pushes Accuracy But Keeps Weights Closed

Alibaba’s Qwen team unveils a new speech recognition model covering 11 languages and multiple accents,...

Stanford and UC Santa Cruz Launch Benchmark for Audio-Language Models

Stanford and UC Santa Cruz launch a benchmark for audio-language models, with Google’s Gemini 2.5...

Inside IWSLT 2025: Key Takeaways for AI Live Speech Translation and Subtitling

At IWSLT 2025, researchers and industry share ways to boost speed, quality, subtitles, model size,...

Multilingual Foundation AI Provider MiniMax Seeks USD 500M IPO

MiniMax is one of China’s “Six Tigers” — six AI firms at the top of...

Google Flags Serious Data Quality Issues in Public Multilingual Speech Datasets

A new Google study finds that multilingual speech datasets suffer from serious data quality issues...

IIT Bombay Explores Accent-Aware Speech Translation

IIT Bombay researchers propose a new approach to speech-to-speech translation that not only translates speech...

US Healthcare Firm Converts Multilingual Speech into Structured Medical Reports

Inspired Spine’s SURI converts multilingual doctor-patient conversations into structured medical reports using automatic speech recognition...

New Research Tackles Key Challenges in AI Speech Recognition and Translation

Tsinghua and Cambridge researchers introduce a technique that expands multilingual speech models without full retraining,...

1 2 3 4 5
0
    0
    Your Cart
    Your cart is empty
    Privacy Overview

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.