OpenAI prepares bidirectional voice mode for rollout on ChatGPT (2 minute read)

TLDR AI Models

Summary

OpenAI is rolling out a new bidirectional voice model (Bidi 1) for ChatGPT that allows simultaneous speaking, hearing, and listening, real-time translation, and improved conversation context handling. The upgrade is appearing in the web interface and app for some users, with a broader release expected soon.

OpenAI has started rolling out Bidirectional Voice Mode for ChatGPT. The company's new audio generation model, Bidi 1, lets the assistant speak, hear, and listen at the same time. It is able to hold the thread of a whole conversation and switch tasks on the fly if interrupted. The model can sing and beatbox, but there are some tight copyright restrictions. OpenAI has yet to make a formal announcement about the model, but some users are already seeing it in their model selectors.
Original Article
View Cached Full Text

Cached at: 06/24/26, 01:43 PM

# OpenAI prepares bidirectional voice mode for rollout Source: [https://www.testingcatalog.com/openai-prepares-bidirectional-voice-mode-for-rollout-on-chatgpt/](https://www.testingcatalog.com/openai-prepares-bidirectional-voice-mode-for-rollout-on-chatgpt/) [![Google Preferred Source](https://www.testingcatalog.com/assets/images/google_preferred_source_badge_light_en.png?v=d81cbce2e9)](https://google.com/preferences/source?q=testingcatalog.com) OpenAI looks set to hand ChatGPT's voice mode its biggest upgrade in months, with a next\-generation audio model surfacing as[Bidi 1](https://www.testingcatalog.com/openai-prepares-major-chatgpt-voice-upgrade-with-gpt-bidi-1/), shorthand for the bidirectional design that lets the assistant speak, hear, and listen at once\. References to it began appearing in the ChatGPT web interface ahead of a possible release this week, and it has already begun reaching a subset of users in the app\. > BREAKING 🔥: First tests of "Bidi 1", an upcoming bidirectional voice model from OpenAI\. This upgrade will arrive in ChatGPT and, potentially, in Codex soon as well\. \> Bidi 1 can speak over while you are talking and keep listening\. \> Bidi 1 can switch between tasks back and…[https://t\.co/BwWhCKx3G0](https://t.co/BwWhCKx3G0?ref=testingcatalog.com)[pic\.twitter\.com/Fawc74kBym](https://t.co/Fawc74kBym?ref=testingcatalog.com) — 🚨 AI News \| TestingCatalog \(@testingcatalog\)[June 23, 2026](https://x.com/testingcatalog/status/2069331697615749530?ref_src=twsrc%5Etfw&ref=testingcatalog.com) In our early testing, the gap from today's advanced voice mode is plain\. Bidi 1 sits in the model selector under settings, beside the standard and advanced options, and turns the voice bubble yellow once picked\. It offers small, natural acknowledgments — an "okay" or a brief nod — when you pause or slow down, without cutting across you\. It also switches tasks on the fly: ask it to count to ten, interrupt to reverse the count, and it adjusts immediately\. > OPENAI 🔥: An upcoming Bidi 1 voice model will be able to translate in real\-time\! This will unlock a huge pile of use cases to be built on top of when it lands on the APIs\.[pic\.twitter\.com/95sRnSzJfs](https://t.co/95sRnSzJfs?ref=testingcatalog.com) — 🚨 AI News \| TestingCatalog \(@testingcatalog\)[June 23, 2026](https://x.com/testingcatalog/status/2069351216648204757?ref_src=twsrc%5Etfw&ref=testingcatalog.com) More usefully, it holds the thread of a whole conversation rather than dropping earlier context, the weak point that has long dogged the current voice stack, and it no longer jumps in during longer pauses\. ![ChatGPT](https://storage.ghost.io/c/2a/1b/2a1b1782-8506-4d7d-bf53-ad3fb52e2a0f/content/images/2026/06/ChatGPT-06-23-2026_02_41_AM.jpg)Creative behavior carries over from the first advanced voice rollout, singing and beatboxing included, though copyright handling is tighter; it declines popular songs outright while still attempting an original piece in a chosen artist's style\. The move reads as[OpenAI](https://www.testingcatalog.com/tag/chatgpt/)closing the distance between its capable text models and an older voice layer, treating conversation as a core route into ChatGPT\. The company has not formally announced it\. A gradual, opt\-in release across web and mobile looks likely, with the European Economic Area possibly waiting longer \(not confirmed\)\. Codex appears set for its own voice upgrade in the weeks after this launch, separate from it, and API access may follow later still \(timeline is not confirmed\)\.

Similar Articles

@FinanceYF5: OpenAI's new voice model Bidi 1 first test exposure - Bidirectional voice design: while you speak, it listens; you can interrupt mid-way to switch tasks immediately, no longer grabbing the conversation when you pause. It also supports real-time translation, and context memory is much stronger than the current Advanced Voice. It's now being pushed to a small group, ChatGPT …

X AI KOLs Following

OpenAI's new voice model Bidi 1 first test exposure, supports bidirectional voice design, real-time translation, and stronger context memory, currently being pushed to a small group on ChatGPT.

ChatGPT can now see, hear, and speak

OpenAI Blog

OpenAI is rolling out new voice and image capabilities to ChatGPT Plus and Enterprise users, enabling users to have voice conversations and share images for multimodal interactions powered by GPT-3.5/GPT-4 and custom text-to-speech models.