Tag
OpenAI's GPT-5.5 costs 49–92% more than GPT-5.4 in practice despite claimed token efficiency improvements, while Anthropic's Claude Opus 4.7 also raised effective costs by 12–27% for longer prompts, reflecting a broader trend of rising frontier model prices as both companies face massive projected losses.
OpenAI shipped multiple GPT models and features in approximately 15 days, including GPT Image 2, various GPT 5.5 variants (pro, instant, cyber), GPT Realtime 2, and related tools.
OpenAI is improving safeguards to prevent chain-of-thought grading issues in model training, including real-time detection, accidental grading prevention, and stress tests.
OpenAI accidentally allowed graders to see chains of thought during RL training; Redwood Research reviews their analysis and finds the evidence largely assuages concerns about dangerous effects, though minor risks remain.
OpenAI announces a migration guide for users to switch from ChatGPT to Codex, a dedicated AI coding assistant.
OpenAI teaser about upcoming io device featuring a fully agentic assistant that understands user context, sees their world, and acts across their digital life as a new interface for reality.
A blog post exploring how human typing habits like typos, shorthand, filler words, and whitespace affect token counts in OpenAI and Claude tokenizers, noting that common misspellings can inflate token usage and costs without changing meaning.
OpenAI has launched three new real-time audio models to enable continuous, multitasking voice interactions that prioritize long-context reasoning, live translation, and seamless tool use.
Newly revealed 2017-2018 emails from Microsoft executives show early reservations about investing in OpenAI, introduced as evidence in the Musk v. Altman federal trial. The emails reveal internal skepticism about OpenAI's near-term AGI prospects and financial concerns, despite Microsoft eventually making its landmark $1 billion investment.
OpenAI has launched GPT-Realtime-2, integrating GPT-5-level reasoning into the real-time voice API, enabling voice assistants to think and solve problems in real time during conversations.
GPT-5.5-Cyber is now in limited preview for defenders, offering a capable model for securing critical infrastructure.
OpenAI released the GPT-Realtime-2 voice model, featuring GPT-5-level reasoning capabilities and a 128,000 token context window. It supports real-time translation from over 70 input languages to 13 output languages, achieving 96.6% accuracy on the Big Bench Audio Intelligence benchmark. Greg Brockman called it a milestone in voice translation.
The author shares a debugging experience where an agent loop was caused by a harness truncating tool outputs rather than model failure, highlighting the reliability gap in agent infrastructure compared to models.
Sam Altman states that OpenAI aims to assist companies in securing themselves and emphasizes the urgency of beginning this work immediately.
The article analyzes the cost implications of a price increase for the GPT-5.5 model as reported by OpenRouter.
The article questions the current status of Ilya Sutskever's Superaligned Systems company, noting the lack of a product release after two years.
OpenAI has released gpt-realtime-2, a new speech-to-speech model optimized for real-time voice agent interactions with low-latency tool calling.
Brian highlights the critical importance of recruiting, noting that hiring is often more vital than management for building a great company.
OpenAI has released a new Chrome extension for Codex that enables the AI to handle browser-based tasks such as debugging flows, checking dashboards, conducting research, and updating CRMs directly within the browser environment.
OpenAI highlights Codex's ability to autonomously select and combine the best tools, such as plugins or a browser, for each step of a complex task.