Our evaluation of OpenAI's GPT-5.5 cyber capabilities
Summary
Simon Willison evaluates OpenAI's GPT-5.5 cyber capabilities, examining its performance in cybersecurity tasks.
View Cached Full Text
Cached at: 05/08/26, 06:57 AM
Similar Articles
@sama: We want to help all companies be secure, working with the USG and the security ecosystem. *The full version of GPT-5.5-…
OpenAI releases the full version of GPT-5.5-Cyber, a cybersecurity-focused AI model with state-of-the-art performance on CyberGym, and announces efforts to improve security through Patch The Planet and Codex Security.
Strengthening cyber resilience as AI capabilities advance
OpenAI publishes a comprehensive framework for managing cyber capabilities in AI models, noting significant improvements in CTF performance from GPT-5 (27%) to GPT-5.1-Codex-Max (76%), and outlining defense-in-depth safeguards to ensure advanced models primarily benefit defenders while limiting offensive misuse.
OpenAI launches new security tools and updates GPT-5.5-Cyber (2 minute read)
OpenAI launches new security tools including Codex Security plugin and an updated GPT-5.5-Cyber model, alongside the Daybreak initiative and Patch the Planet open-source project, shifting from vulnerability discovery to automated patch generation.
an updated GPT-5.5 Cyber outperforms Mythos 5 in CyberGym
An updated GPT-5.5 Cyber model surpasses Mythos 5 in the CyberGym benchmark.
Scaling Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber
OpenAI announces the rollout of GPT-5.5-Cyber and expands Trusted Access for Cyber (TAC) to provide specialized cybersecurity capabilities to verified defenders while maintaining strict safeguards against misuse.