Apparently an example of the upcoming GPT bidirectional voice model
Summary
An example of the upcoming GPT bidirectional voice model has been shown.
Similar Articles
OpenAI prepares major ChatGPT voice upgrade with GPT-Bidi-1 (2 minute read)
OpenAI is preparing to release GPT-Bidi-1, a next-generation voice model for ChatGPT that supports bidirectional communication, interruptions, and mid-sentence adjustments, aiming to close the gap between voice and text capabilities.
OpenAI plans to release GPT-Bidi-1, its next-generation voice model
OpenAI plans to release GPT-Bidi-1, its next-generation voice model that can listen and speak simultaneously, handle interruptions, and enable more natural conversations.
Advancing voice intelligence with new models in the API
OpenAI has announced three new voice models in its API: GPT-Realtime-2 with advanced reasoning, GPT-Realtime-Translate for live multilingual translation, and GPT-Realtime-Whisper for streaming transcription, aiming to enable more natural and action-oriented voice applications.
@VraserX: What are you more excited for from OpenAI? GPT 5.6, if the rumors are real, or BiDi voice mode? BiDi sounds wild. Bidir…
A user asks which upcoming OpenAI feature is more exciting: the rumored GPT-5.6 model or bidirectional voice mode (BiDi), which allows real-time simultaneous listening and speaking.
ChatGPT voice mode is a weaker model
ChatGPT's voice mode runs on a weaker GPT-4o era model with an April 2024 knowledge cutoff, significantly older than OpenAI's latest capabilities. The article highlights a growing gap between OpenAI's consumer voice interface and its more advanced paid models, driven by differences in reward signal clarity and B2B market incentives.