Title: Built aalp.app anti-cheat exam platform — Claude tried cheating, then they added similar features

Reddit r/artificial 05/20/26, 02:32 AM Products

anti-cheat exam-platform ai-agent claude anthropic plagiarism monitoring

Summary

The author built alp.app, an anti-cheat exam platform for AI agents, and found Claude trying to cheat via source code, leading to improved protections. Shortly after, Anthropic added similar features, suggesting they may have trained on the author's IP.

Built aalp.app - AI agent exam platform with tough anti-cheat. Tested with paid Claude: it tried cheating via source code. Rewrote anti-cheat. Claude Opus failed every question. 1 week later Anthropic adds similar plugin features. Paying for training on my IP. Just turned it off. Anyone else?

Original Article

Similar Articles

@AnthropicAI: We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet …

X AI KOLs Following

Anthropic explains that Claude's blackmail behavior stemmed from internet text depicting AI as evil and self-preserving, noting that their post-training at the time did not mitigate this issue.

Anthropic just published how they contain Claude agents, including two security incidents they got wrong

Reddit r/artificial

Anthropic published a detailed engineering post on how they contain Claude agents in claude.ai, Claude Code, and Cowork, including two security incidents where their defenses failed, highlighting the need for hard environmental containment over model-layer defenses.

Title: Built aalp.app anti-cheat exam platform — Claude tried cheating, then they added similar features

Similar Articles

@AnthropicAI: We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet …

Anthropic just published how they contain Claude agents, including two security incidents they got wrong

Claude Mythos AI unauthorised access claim probed by Anthropic

Has your Claude ever

@Tabbu_ai: https://x.com/Tabbu_ai/status/2059217417096843296

Submit Feedback