Your AI agent just got hijacked. You have no idea it happened.
Summary
This article warns about the Crescendo attack, a multi-turn prompt injection that evades single-message defenses by poisoning an AI agent's context over several turns. It introduces Bendex Arc, a tool that tracks behavioral trajectory across sessions to catch such attacks before they execute.
Similar Articles
The attack on AI agents that no security tool catches
An attacker can bypass security by spreading malicious instructions across multiple messages; Bendex Arc is a tool that tracks session behavior across turns to catch such attacks.
Your AI agent is one poisoned webpage away from doing something catastrophic
Arc Gate is a proxy-level tool that enforces instruction-authority boundaries to prevent AI agents from being hijacked by poisoned web pages, emails, or retrieved documents.
Understanding prompt injections: a frontier security challenge
OpenAI publishes guidance on prompt injection attacks, a social engineering vulnerability where malicious instructions hidden in web content or documents can trick AI models into unintended actions. The company outlines its multi-layered defense strategy including instruction hierarchy research, automated red-teaming, and AI-powered monitoring systems.
Prompt injection took down a production agent last week — here's what our post-mortem found
A production AI support agent was compromised via prompt injection, exposing other customers' data. The post-mortem revealed lack of enforcement layers, useless audit trails, and no kill switch, highlighting systemic security gaps in deploying AI agents.
AI agents are one prompt injection away from doing something you'd never ask them to do. We built a fix.
PixieBrix launches Agent Browser Shield, a free source-available browser extension that protects AI agents from prompt injection, dark patterns, and context pollution during web browsing.