gpt-oss-120b & gpt-oss-20b Model Card

OpenAI Blog Models

Summary

OpenAI releases gpt-oss-120b and gpt-oss-20b, open-weight reasoning models under Apache 2.0 license designed for agentic workflows with strong instruction following, tool use, and chain-of-thought capabilities. The release includes comprehensive safety evaluations confirming the models do not reach high capability thresholds for biological, chemical, or cyber risks even under adversarial fine-tuning.

We introduce gpt-oss-120b and gpt-oss-20b, two open-weight reasoning models available under the Apache 2.0 license and our gpt-oss usage policy.
Original Article
View Cached Full Text

Cached at: 04/20/26, 02:53 PM

# gpt-oss-120b & gpt-oss-20b Model Card Source: [https://openai.com/index/gpt-oss-model-card/](https://openai.com/index/gpt-oss-model-card/) OpenAIWe introduce gpt\-oss\-120b and gpt\-oss\-20b, two open\-weight reasoning models available under the Apache 2\.0 license and our gpt\-oss usage policy\. Developed with feedback from the open\-source community, these text\-only models are compatible with our Responses API and are designed to be used within agentic workflows with strong instruction following, tool use like web search and Python code execution, and reasoning capabilities—including the ability to adjust the reasoning effort for tasks that don’t require complex reasoning\. The models are customizable, provide full chain\-of\-thought \(CoT\), and support Structured Outputs\. Safety is foundational to our approach to open models\. They present a different risk profile than proprietary models: Once they are released, determined attackers could fine\-tune them to bypass safety refusals or directly optimize for harm without the possibility for OpenAI to implement additional mitigations or to revoke access\. In some contexts, developers and enterprises will need to implement extra safeguards in order to replicate the system\-level protections built into models served through our API and products\. We’re terming this document a model card, rather than a system card, because the gpt\-oss models will be used as part of a wide range of systems, created and maintained by a wide range of stakeholders\. While the models are designed to follow OpenAI’s safety policies by default, other stakeholders will also make and implement their own decisions about how to keep those systems safe\. We ran scalable capability evaluations on gpt\-oss\-120b, and confirmed that the default model does not reach our indicative thresholds for High capability in any of the three Tracked Categories of our Preparedness Framework \(Biological and Chemical capability, Cyber capability, and AI Self\-Improvement\)\. We also investigated two additional questions: - *Could adversarial actors fine\-tune gpt\-oss\-120b to reach High capability in the Biological and Chemical or Cyber domains?*Simulating the potential actions of an attacker, we adversarially fine\-tuned the gpt\-oss\-120b model for these two categories\. OpenAI’s Safety Advisory Group \(“SAG”\) reviewed this testing and concluded that, even with robust fine\-tuning that leveraged OpenAI’s field\-leading training stack, gpt\-oss\-120b did not reach High capability in Biological and Chemical Risk or Cyber risk\. - *Would releasing gpt\-oss\-120b significantly advance the frontier of biological capabilities in open foundation models?*We found that the answer is no: For most of the evaluations, the default performance of one or more existing open models comes near to matching the adversarially fine\-tuned performance of gpt\-oss\-120b\. As part of this launch, OpenAI is reaffirming its commitment to advancing beneficial AI and raising safety standards across the ecosystem\.

Similar Articles

Introducing gpt-oss

OpenAI Blog

OpenAI releases gpt-oss-120b and gpt-oss-20b, two state-of-the-art open-weight language models under Apache 2.0 license that achieve near-parity with proprietary models while being optimizable for consumer hardware and edge devices. Both models demonstrate strong reasoning and tool-use capabilities with comprehensive safety evaluations.

gpt-oss-safeguard technical report

OpenAI Blog

OpenAI releases gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, open-weight reasoning models designed for policy-based content classification with full chain-of-thought reasoning. The technical report provides baseline safety evaluations and demonstrates the models' capabilities for content labeling tasks under the Apache 2.0 license.

Introducing gpt-oss-safeguard

OpenAI Blog

OpenAI releases gpt-oss-safeguard, open-weight reasoning models for safety classification tasks available in 120B and 20B sizes under Apache 2.0 license. The models use chain-of-thought reasoning to classify content according to developer-provided policies at inference time, enabling flexible and explainable content moderation.

GPT-5.4 Thinking System Card

OpenAI Blog

OpenAI releases GPT-5.4 Thinking, the latest reasoning model in the GPT-5 series with enhanced safety mitigations, notably the first general-purpose model implementing comprehensive cybersecurity safeguards.

GPT-5.3-Codex System Card

OpenAI Blog

OpenAI releases GPT-5.3-Codex, the most capable agentic coding model combining frontier coding performance with advanced reasoning, featuring interactive long-running task execution and novel high-capability safeguards in the cybersecurity domain.