New Speech-to-Text Models, Updates, and Benchmarks to Watch
Recent speech-to-text updates span model launches, product updates, and new benchmarks.
Recent speech-to-text updates span model launches, product updates, and new benchmarks.
With new benchmarks and tools, Gladia is pushing transparency in speech AI evaluation — while...
Microsoft unveils in-house transcription and speech synthesis models, expanding its first-party speech AI stack.
AssemblyAI launches Medical Mode, a domain-specific speech-to-text add-on designed to improve recognition of medical terminology...
Revenue in the millions, valuation in the billions: readers vote on the ElevenLabs valuation megajump,...
Recent releases from Microsoft and AssemblyAI reflect growing interest in structured, configurable speech recognition as...
Microsoft releases VibeVoice-ASR, an open-source speech-to-text model designed for long-form audio, structured transcription, and customised...
The US-based company told Slator about the roadmap for its multilingual AI models, AI funding...
NVIDIA doubles down on open speech AI with ultra-low-latency automatic speech recognition and multilingual text-to-speech...
Meta AI rolls out Omnilingual ASR, an open-source automatic speech recognition suite covering 1,600+ languages...
Free, open-source transcription app Whispering lets users keep recordings on-device or connect directly to providers...