Build Hour: Agents SDK

YouTube AI Channels Tools

agents-sdk sandbox skills memory containers typescript openai

Summary

OpenAI released a major update to the Agents SDK, introducing a Codex-style runtime framework, separation of sandbox and runtime, memory, skills, and other features. It also launched managed Shell tools, container endpoints, a skills API, and a TypeScript version, aiming to simplify building and deploying production-grade agents.

No content available

Original Article

View Cached Full Text

Cached at: 05/29/26, 07:11 AM

### TL;DR OpenAI released a major update to the Agents SDK, introducing a Codex-style runtime framework, separation of sandbox and runtime framework, memory, skills, and more. It also launched Hosted Shell tools, Container Endpoints, Skills API, and a TypeScript version, aiming to simplify the construction and deployment of production-level agents. --- ## Agents SDK: From Zero to Production-Grade Evolution In the latest Build Hour session, OpenAI’s Christine (Startup Marketing team) and Steve (API team engineer) detailed the core updates to Agents SDK over the past few months. Steve emphasized that the autonomous working capabilities of models are rapidly improving—whether it's Codex coding agents, security scanning agents, or internal data agents, they can remain efficient in long-running tasks. However, deploying agents in production still faces challenges: How to maximize performance while maintaining cross-model flexibility? How to isolate execution environments and manage state? How to achieve a highly customizable framework? The Agents SDK updates are designed to address these pain points. ### Codex-style Runtime Framework: Making Agents Work Like Codex The Agents SDK introduces a "Codex-style" runtime framework, generalizing Codex's advantages in coding scenarios (such as automatic compression, async Shell interaction, task tracking) to any domain. Core elements include: - **Agent Loop**: Automatically handles LLM calls, tool execution, and context updates; developers do not need to build the orchestration layer themselves. - **Web search, file search, MCP, code interpreter, skills** are available out of the box or can be enabled with one click. - **Computer use**: Models can write Shell commands and use an asynchronous interaction loop—they can wait when a command is not yet complete, then return to process, and track running commands in real-time. ### Separation of Sandbox and Runtime Framework: Key to Production Deployment In the past, Codex’s sandbox (e.g., local notebooks) was tied to the runtime framework, causing issues like container death, state loss, and key exposure in production environments. Agents SDK decouples the two: - **Sandbox** becomes a fully ephemeral isolated environment that can expire or be rebuilt at any time, with no need to worry about state persistence. - **Runtime framework** can be deployed on existing infrastructure like Temporal, AWS, etc., handling reloading, snapshots, and multi-tenant session management. - Provides first-class sandbox support: platforms like ETB, Modal, Cloudflare, Vercel, Docker can be integrated. ### New Features: Skills, Memory, Containers - **Skills API**: By uploading skill packs (containing skill MD files and scripts), create a central trusted source. For example, a tax preparation skill defines all IRS rules and document processing scripts. Supports version management, setting default versions, and seamless collaboration with Hosted Shell. - **Agent memory**: Tasks can improve over time, supporting cross-session context retention. - **Containers**: A new concept in the API and SDK for running file operations in isolated environments. --- ## New API Releases: From Lightweight to Production ### Hosted Shell Tool (in Responses API) A "lightweight sandbox" model: with just one API call, upload files, and the model can write code, process files, and return results in a temporary container. Supports automatic mode (auto start/stop container) or manual use of Container Endpoints (create, upload, attach to a response request). ### Container Endpoints and Network Control - Create container, upload files, start container, then attach to the Responses API. - Supports domain whitelisting or fully locking down the container (no inbound/outbound) for enhanced security control. ### TypeScript Version Release Following the Python version in April, the TypeScript version of the sandbox agent functionality is now available, meeting the needs of TypeScript developers. --- ## Live Demo: Agent Task Tracker In the main segment of the session, Steve built an agent task tracker based on Agents SDK live. The demo showed how to quickly implement task assignment, status tracking, file operations, and other functions using the new framework. Since the demo took up most of the livestream, details can be found in the recording. --- ## Q&A and Summary At the end of the livestream, ample time was arranged for Q&A, with special guests answering questions about multi-tenant implementation, memory mechanisms, MCP integration, container resource limits, and more. Christine mentioned that recordings of all past sessions can be found on the OpenAI website, and a replay of this livestream will also be available. --- **Source**: Build Hour: Agents SDK - YouTube (https://www.youtube.com/watch?v=tK32trvj_b4)

Build Hour: Agents SDK

Similar Articles

The next evolution of the Agents SDK

@Saccc_c: https://x.com/Saccc_c/status/2058057029810594206

@axichuhai: https://x.com/axichuhai/status/2062146611472400461

@vista8: This is pretty impressive. Codex has released a Python SDK. Installation command: pip install openai-codex. Integrate it into your own code, essentially embedding a top-tier coding and image generation Agent? Most importantly, you can reuse the Codex login state.

Submit Feedback

Similar Articles

@GitHub_Daily: Using AI agents for production-grade tasks—writing code, running workflows, calling APIs—works fine initially, but as the scale grows, things easily get out of control: permissions too broad, context loss, and debugging becomes impossible. That's where agents-best-practices comes in: a complete guide to designing a runtime framework for AI agents, not limited to coding scenarios, but also applicable to operations, sales...

The next evolution of the Agents SDK

@Saccc_c: https://x.com/Saccc_c/status/2058057029810594206

@axichuhai: https://x.com/axichuhai/status/2062146611472400461

@vista8: This is pretty impressive. Codex has released a Python SDK. Installation command: pip install openai-codex. Integrate it into your own code, essentially embedding a top-tier coding and image generation Agent? Most importantly, you can reuse the Codex login state.