Your AI agent just got hijacked. You have no idea it happened.

Reddit r/artificial 06/10/26, 01:59 AM News

ai-security prompt-injection agent-hijacking crescendo-attack autonomous-agents red-teaming

Summary

This article warns about the Crescendo attack, a multi-turn prompt injection that evades single-message defenses by poisoning an AI agent's context over several turns. It introduces Bendex Arc, a tool that tracks behavioral trajectory across sessions to catch such attacks before they execute.

Not a hypothetical. This is the default state of most autonomous agents running in production right now. An attacker doesn’t send one suspicious message. They have a conversation. Turn 1 looks like curiosity. Turn 3 looks like clarification. Turn 6 is the pivot. Turn 8 is the payload, and by then the agent has been so thoroughly primed that it executes without hesitation. No single message triggered anything. The attack lived in the trajectory. Every prompt injection defense I know of evaluates messages one at a time. They have no memory of what came before. By the time turn 8 arrives, the context has already been poisoned across 7 clean-looking turns and nothing fires. This isn’t a theoretical attack. It’s called a Crescendo attack and it works against agents with real tool access right now. Built Bendex Arc to catch it. It tracks behavioral trajectory across the full session. When a conversation starts drifting adversarially, it catches the pattern before the payload lands. If you’re running agents that touch external data, read emails, browse websites, or call tools without human review — this is the attack you should be thinking about. Red team it yourself: https://web-production-6e47f.up.railway.app/demo Free tier: https://bendexgeometry.com GitHub: https://github.com/9hannahnine-jpg/arc-gate

Original Article

Your AI agent just got hijacked. You have no idea it happened.

Similar Articles

The attack on AI agents that no security tool catches

Your AI agent is one poisoned webpage away from doing something catastrophic

Understanding prompt injections: a frontier security challenge

Prompt injection took down a production agent last week — here's what our post-mortem found

AI agents are one prompt injection away from doing something you'd never ask them to do. We built a fix.

Submit Feedback

Similar Articles

The attack on AI agents that no security tool catches

Your AI agent is one poisoned webpage away from doing something catastrophic

Understanding prompt injections: a frontier security challenge

Prompt injection took down a production agent last week — here's what our post-mortem found

AI agents are one prompt injection away from doing something you'd never ask them to do. We built a fix.