Tag
Multiple AI model releases are delayed: GPT-5.6 now expected mid-July, DeepMind's 3.5 Pro postponed, while OpenAI's Bidi voice model and Claude Sonnet 5 for enterprises see progress.
OpenAI plans to release GPT-Bidi-1, its next-generation voice model that can listen and speak simultaneously, handle interruptions, and enable more natural conversations.
DramaBox is a highly expressive voice model based on LTX 2.3, released by Resemble AI with open-source code and models on GitHub and Hugging Face.
OpenAI released the GPT-Realtime-2 voice model, featuring GPT-5-level reasoning capabilities and a 128,000 token context window. It supports real-time translation from over 70 input languages to 13 output languages, achieving 96.6% accuracy on the Big Bench Audio Intelligence benchmark. Greg Brockman called it a milestone in voice translation.