Accent Leakage and Flat Voices: How Amazon Is Making AI Voices More ExpressiveÂ
Amazon notes that accent inconsistency, flat speech, and reliability issues still limit AI voice and...
Amazon notes that accent inconsistency, flat speech, and reliability issues still limit AI voice and...
Microsoft unveils in-house transcription and speech synthesis models, expanding its first-party speech AI stack.
Mistral launches Voxtral TTS, adding speech generation to its Voxtral model family and enabling end-to-end...
A busy start to 2026 for voice AI company Hume AI, with leadership changes, new...
AI text-to-speech company WellSaid takes on venture debt to fund enterprise sales push in crowded...
Researchers argue that text-to-speech evaluation under-tests what matters in real-world deployment — and propose a...
Alibaba’s Qwen team releases a new set of open-source speech models under the Qwen3 family,...
A new cross-lingual voice cloning track at IWSLT 2026 highlights changing priorities in multilingual speech...
NVIDIA doubles down on open speech AI with ultra-low-latency automatic speech recognition and multilingual text-to-speech...
Resemble AI releases an open-source text-to-speech model designed for real-time, expressive voice generation and positioned...
At its annual Adobe MAX 2025 conference, Adobe announced a new feature for its Firefly...