Tag
OpenAI announces security hardening of ChatGPT Atlas against prompt injection attacks through adversarial training and strengthened safeguards, including a rapid response loop for discovering and mitigating novel attack strategies before they appear in the wild.
Anthropic discusses how they contain Claude across products by capping blast radius through containment architectures and reducing human supervision fatigue, sharing lessons from deploying Claude.ai, Claude Code, and Claude Cowork.