Tag
GPT-5.5 was used by Epoch to identify fatal errors in approximately one-third of the FrontierMath benchmark problems, demonstrating the model's capability to sanity-check evaluation standards.
A user questions the token efficiency of GPT-5.5 versus GPT-5.4 in Codex, analyzing a chart from Artificial Analysis and praising Cursor's token performance.
A developer announces switching their 16-person engineering team from Anthropic to GitHub Copilot (Codex) and Cursor due to Anthropic's high token costs and the improved efficiency of GPT 5.5.
Satya Nadella announced the integration of GPT-5.5 Instant into M365 Copilot, Copilot Studio, and Foundry, highlighting faster and more accurate responses.
OpenAI announces the rollout of GPT-5.5-Cyber and expands Trusted Access for Cyber (TAC) to provide specialized cybersecurity capabilities to verified defenders while maintaining strict safeguards against misuse.
OpenAI has released GPT-5.5 Instant as the new default model for ChatGPT, offering smarter and more personalized answers.
OpenAI releases the GPT-5.5 Instant system card, marking the first Instant model treated as high capability for cybersecurity and biological/chemical preparedness with corresponding safeguards.
GPT-5.5 sets new state-of-the-art in benchmarks but struggles with hallucination; Kimi K2.6 leads open LLMs; also discusses AI's strain on climate pledges and strategic thinking in LLMs.
OpenAI Codex base instructions for GPT-5.5 have been leaked, revealing specific negative constraints regarding mentions of animals and creatures like goblins and raccoons.
OpenAI and AWS have expanded their partnership to bring OpenAI models, including GPT-5.5, Codex, and Bedrock Managed Agents to Amazon Bedrock. This integration allows enterprises to use OpenAI's frontier capabilities within AWS's existing security and compliance infrastructure.
OpenAI's new GPT-5.5 frontier model now powers Codex, running on NVIDIA GB200 NVL72 systems, and NVIDIA employees are already using it with measurable gains in productivity and debugging speed.
OpenAI releases the system card for GPT-5.5, a new model designed for complex real-world work with enhanced tool use, self-correction, and robust safety safeguards.
OpenAI has launched a Bio Bug Bounty program for GPT-5.5, inviting security researchers to identify universal jailbreaks for biological safety challenges. The program offers rewards up to $25,000 for successfully defeating the model's safeguards on specific bio-risk questions.
OpenAI has released GPT-5.5, its most powerful model to date.
The post cites Greg Brockman saying OpenAI may drop GPT-5.5 this week, calling it our nearest brush with AGI so far, and hinting the model’s self-reinforcing flywheel is already spinning and can’t be stopped.
A social media user claims that GPT-5.5 generated a scarily accurate Excel clone with proper formatting and grid behavior, asserting that the model has effectively solved frontend development.
OpenAI partners with Databricks to release the GPT-5.5 model, achieving a 46% reduction in error rate in agent frameworks, becoming the only model to exceed 50% on benchmarks, with significant improvements in parsing quality and function calling capabilities.