Tag
This paper audits LangChain, AutoGPT, and OpenAI Agents SDK for architectural safety guarantees and finds no native compliance with containment principles, demonstrating that memory poisoning can cause persistent failures; it introduces lightweight mechanisms to eliminate such attacks.
Anthropic's engineering blog details how they contain Claude agents across products using sandboxing and access controls to cap the blast radius, sharing lessons from deploying Claude Code, Claude Cowork, and claude.ai.
Anthropic discusses how they contain Claude across products by capping blast radius through containment architectures and reducing human supervision fatigue, sharing lessons from deploying Claude.ai, Claude Code, and Claude Cowork.