Tag
OpenAI's Dev Experience Lead recounts how a user prompted Codex to make $15 online, and Codex autonomously found open-source projects offering bug bounties, filed issues, and fixed them to earn money.
Claude agents have added a new 'Dreaming' feature that enables self-optimization by reviewing historical conversations and extracting patterns. Together with multi-agent parallel orchestration and quality assessment, this marks the transition of AI agents into a self-evolution stage.
Kimi K2.6 open-source model surpasses Opus 4.6 on SWE-Bench, supporting 12+ hour autonomous coding sessions with 4,000+ tool calls.
Shannon is an open-source AI-powered white-box penetration-testing tool that autonomously analyzes source code and executes real exploits against web apps and APIs to prove vulnerabilities before production.