Tag
PolyRange is a new open-source benchmark for evaluating offensive AI capabilities on web targets, designed to resist contamination by generating fresh tasks per deployment and including active defense tiers.
Dan Jeffries comments on Cloudflare's testing of Anthropic's Mythos, arguing that the real conversation should focus on practical security improvements against AI-powered attacks, and that AI will ultimately make software more secure if teams adapt their workflows.