#speech-to-text

Array ( [post_type] => post [posts_per_page] => 12 [paged] => 1 [post_status] => publish [tax_query] => Array ( [relation] => AND [0] => Array ( [taxonomy] => post_tag [field] => term_id [terms] => 1593 [operator] => IN ) ) )

New Speech-to-Text Models, Updates, and Benchmarks to Watch

Recent speech-to-text updates span model launches, product updates, and new benchmarks.

Gladia Pushes Transparency in Speech AI Benchmarking

With new benchmarks and tools, Gladia is pushing transparency in speech AI evaluation — while...

Microsoft Rolls Out New Speech Models in Push Toward First-Party AI Stack

Microsoft unveils in-house transcription and speech synthesis models, expanding its first-party speech AI stack.

AssemblyAI Pushes into Healthcare with Medical Mode for Clinical Transcription

AssemblyAI launches Medical Mode, a domain-specific speech-to-text add-on designed to improve recognition of medical terminology...

Is Google Meet Live Translation Ready for Prime Time?

Revenue in the millions, valuation in the billions: readers vote on the ElevenLabs valuation megajump,...

Prompt-Based Control Reaches Enterprise Speech-to-Text

Recent releases from Microsoft and AssemblyAI reflect growing interest in structured, configurable speech recognition as...

Microsoft Unveils VibeVoice-ASR for Long-Form, Multi-Speaker Transcription

Microsoft releases VibeVoice-ASR, an open-source speech-to-text model designed for long-form audio, structured transcription, and customised...

Multilingual AI Voice LTP Deepgram Raises USD 130M in Series C

The US-based company told Slator about the roadmap for its multilingual AI models, AI funding...

NVIDIA Doubles Down on Open Speech AI

NVIDIA doubles down on open speech AI with ultra-low-latency automatic speech recognition and multilingual text-to-speech...

What is Meta’s Omnilingual ASR?

Meta AI rolls out Omnilingual ASR, an open-source automatic speech recognition suite covering 1,600+ languages...

Meet Whispering, an Open‑Source, Local‑First Transcription App

Free, open-source transcription app Whispering lets users keep recordings on-device or connect directly to providers...

New Research Tackles Key Challenges in AI Speech Recognition and Translation

Tsinghua and Cambridge researchers introduce a technique that expands multilingual speech models without full retraining,...

1 2 3 4
0
    0
    Your Cart
    Your cart is empty
    Privacy Overview

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.