Speech & voice

Dubbing and subtitling news from streaming, gaming etc.

Voice Cloning Meets Emotional Speech Synthesis With Alibaba’s Marco-Voice Model

Alibaba’s Marco-Voice combines voice cloning with controllable emotion, delivering more natural and expressive synthetic speech...

Stanford and UC Santa Cruz Launch Benchmark for Audio-Language Models

Stanford and UC Santa Cruz launch a benchmark for audio-language models, with Google’s Gemini 2.5...

Microsoft Research Unveils VibeVoice for Long-Form Speech Synthesis

Microsoft’s VibeVoice is an open-source text-to-speech model that generates podcast-length audio with up to four...

Meet Whispering, an Open‑Source, Local‑First Transcription App

Free, open-source transcription app Whispering lets users keep recordings on-device or connect directly to providers...

Google Translate Improves AI Live Speech Translation Feature

The company now enables users to have both consecutive and real-time conversations in over 70...

Inside IWSLT 2025: Key Takeaways for AI Live Speech Translation and Subtitling

At IWSLT 2025, researchers and industry share ways to boost speed, quality, subtitles, model size,...

AI Live Speech Translation with Voice Cloning from ByteDance

ByteDance introduces a product-ready AI live speech translation system that delivers near-human accuracy, real-time voice...

Mistral Debuts Open-Source Voxtral for AI Speech Translation and Transcription

Mistral releases Voxtral, a new family of open-source models for AI speech translation and transcription,...

Multilingual Foundation AI Provider MiniMax Seeks USD 500M IPO

MiniMax is one of China’s “Six Tigers” — six AI firms at the top of...

Microsoft Says This New Voice Conversion Feature Will Improve AI Dubbing

Microsoft introduces a voice conversion feature in Azure AI Speech, allowing users to transform recorded...

Google Flags Serious Data Quality Issues in Public Multilingual Speech Datasets

A new Google study finds that multilingual speech datasets suffer from serious data quality issues...

AI Transcription in Healthcare Faces Regulatory Scrutiny

A short time after encouraging its use among healthcare practitioners, NHS England has asked them...

OpenAI, a ‘Silly’ Lawsuit, and AI Live Speech Translation as a ‘Killer App’

OpenAI's Altman called a trademark lawsuit by iyO “silly,” citing iyO's past “babelfish” app collab...

Voice Actors to Vote on New Video Game Agreement Regarding AI Speech Technology

The deal, negotiated by the Screen Actors Guild-American Federation of Television and Radio Artists, includes...

Swiss Researchers Adapt Voice Cloning to Swiss German Dialects

Researchers at ZHAW fine-tuned a multilingual text-to-speech model on nearly 5,000 hours of podcast audio...

1 2 3 4 5
0
    0
    Your Cart
    Your cart is empty
    Privacy Overview

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.