The “Realtime API,” designed to enable developers to build speech-to-speech experiences, was also introduced in October 2024, and new voices were added in April 2025.
Live Speech Translation
AVM with a live speech translation flow in ChatGPT has some users already sounding off on X about its strengths, like @JeffreyJonah5, who commented “This changes everything if you’re traveling abroad. ChatGPT can now stay in translation mode no reset needed, just talk. The new voice sounds way more human. More emotion, better pacing. It’s getting scary good.”
Slator 2025 Language Industry Market Report
The 150-page report offers a comprehensive view of the 2025 global market — with market sizing, AI capability breakdowns, buyer insights, use cases, survey data, and projections through 2030.
The upgraded AVM is now available to all paid ChatGPT users in the Plus, Teams, Enterprise, and Edu paid tiers. Users can access the feature by tapping the Voice icon within the message composer.
While this is an important update to the AVM feature, OpenAI does acknowledge some known limitations, including occasional minor decreases in audio quality and infrequent hallucinations that produce unintended sounds, like ads or just gibberish.
At the time of publication, Slator was able to corroborate the AVM translation feature works only on the mobile interface, and that it switches between language learner mode and conversation mode even after prompting it for translation — as reported by some early users on X — and that it sometimes simply does not translate and stays silent.
OpenAI states that it is actively working to resolve these issues. In the meantime, the company has updated its Frequently Asked Questions site with more feature information.