speech-to-text

Speech & voice

New Speech-to-Text Models, Updates, and Benchmarks to Watch

Recent speech-to-text updates span model launches, product updates, and new benchmarks.

Speech & voice

Gladia Pushes Transparency in Speech AI Benchmarking

With new benchmarks and tools, Gladia is pushing transparency in speech AI evaluation — while...

Speech & voice

Microsoft Rolls Out New Speech Models in Push Toward First-Party AI Stack

Microsoft unveils in-house transcription and speech synthesis models, expanding its first-party speech AI stack.

Speech & voice

AssemblyAI Pushes into Healthcare with Medical Mode for Clinical Transcription

AssemblyAI launches Medical Mode, a domain-specific speech-to-text add-on designed to improve recognition of medical terminology...

Features

Is Google Meet Live Translation Ready for Prime Time?

Revenue in the millions, valuation in the billions: readers vote on the ElevenLabs valuation megajump,...

Speech & voice

Prompt-Based Control Reaches Enterprise Speech-to-Text

Recent releases from Microsoft and AssemblyAI reflect growing interest in structured, configurable speech recognition as...

Speech & voice

Microsoft Unveils VibeVoice-ASR for Long-Form, Multi-Speaker Transcription

Microsoft releases VibeVoice-ASR, an open-source speech-to-text model designed for long-form audio, structured transcription, and customised...

Investment & funding

Multilingual AI Voice LTP Deepgram Raises USD 130M in Series C

The US-based company told Slator about the roadmap for its multilingual AI models, AI funding...

Speech & voice

NVIDIA Doubles Down on Open Speech AI

NVIDIA doubles down on open speech AI with ultra-low-latency automatic speech recognition and multilingual text-to-speech...

Speech & voice

What is Meta’s Omnilingual ASR?

Meta AI rolls out Omnilingual ASR, an open-source automatic speech recognition suite covering 1,600+ languages...

Speech & voice

Meet Whispering, an Open‑Source, Local‑First Transcription App

Free, open-source transcription app Whispering lets users keep recordings on-device or connect directly to providers...

Language AI

New Research Tackles Key Challenges in AI Speech Recognition and Translation

Tsinghua and Cambridge researchers introduce a technique that expands multilingual speech models without full retraining,...

#speech-to-text

New Speech-to-Text Models, Updates, and Benchmarks to Watch

Gladia Pushes Transparency in Speech AI Benchmarking

Microsoft Rolls Out New Speech Models in Push Toward First-Party AI Stack

AssemblyAI Pushes into Healthcare with Medical Mode for Clinical Transcription

Is Google Meet Live Translation Ready for Prime Time?

Prompt-Based Control Reaches Enterprise Speech-to-Text

Microsoft Unveils VibeVoice-ASR for Long-Form, Multi-Speaker Transcription

Multilingual AI Voice LTP Deepgram Raises USD 130M in Series C

NVIDIA Doubles Down on Open Speech AI

What is Meta’s Omnilingual ASR?

Meet Whispering, an Open‑Source, Local‑First Transcription App

New Research Tackles Key Challenges in AI Speech Recognition and Translation