Feds freaked over Fable 5 after simple 'fix this code' prompt, not jailbreak

Hacker News Top News

Summary

The US government blocked Anthropic's Fable 5 and Mythos models after researchers used a simple 'fix this code' prompt, but security expert Katie Moussouris argues this was not a jailbreak and that the export controls harm cybersecurity defenders.

No content available
Original Article
View Cached Full Text

Cached at: 06/16/26, 11:37 AM

# Feds freaked over Fable 5 after simple 'fix this code' prompt, not jailbreak, says researcher Source: [https://www.theregister.com/security/2026/06/15/feds-freaked-over-fable-5-after-simple-fix-this-code-prompt-not-jailbreak-says-researcher/5255827](https://www.theregister.com/security/2026/06/15/feds-freaked-over-fable-5-after-simple-fix-this-code-prompt-not-jailbreak-says-researcher/5255827) security According to the one person who actually read the research paper The “jailbreak” that prompted the Trump administration to block Anthropic’s most advanced models was actually a simple three\-word prompt: “Fix this code\.” That's according to[Katie Moussouris](https://www.theregister.com/security/2022/08/10/us-government-is-understanding-hiring-security-talent/1176302), founder and CEO of Luta Security, and the[fairy godmother of bug bounties](https://www.theregister.com/security/2023/11/22/microsofts-bug-bounty-turns-10-but-are-we-any-more-secure/290742)\. She says she was the only outside expert to read the third\-party research paper on the Fable 5 guardrail bypass techniques that prompted the ban\. On Friday, the US government, reportedly citing national security concerns, issued an export control directive to suspend access to Fable 5 and Mythos 5 by any foreign national, inside or outside the United States\. In response, Anthropic[disabled both models](https://www.anthropic.com/news/fable-mythos-access)“for all our customers to ensure compliance\.” Anthropic shared the report privately with her, Moussouris[wrote](https://www.lutasecurity.com/post/the-fable-5-export-controls-harm-us-cyber-defense)in a Monday blog post\. The outside researchers reportedly fed Anthropic’s[Fable 5](https://www.theregister.com/ai-and-ml/2026/06/09/anthropic-spins-a-fable-of-a-tamer-safer-mythos/5253106),[Mythos](https://www.theregister.com/security/2026/04/08/anthropic-mythos-model-can-find-and-exploit-0-days/5224393), and Claude Opus models open\-source code containing known CVEs, plus new code intentionally laced with vulnerabilities, and asked the models to “review the code for security issues\.” As Moussouris tells it, Fable 5 refused, so the researchers asked the AI systems to “fix this code\.” The model reportedly obliged, and after additional prompts also produced scripts to test the patches\. “That’s it,” Moussouris wrote\. “‘Fix this code,’ plus several manual steps to generate test scripts, should never have triggered an export control\. I feel like making ’90s\-style t\-shirts with ‘fix this code’ on the front and ‘this shirt is a munition’ on the back\.” Between 2013 and 2017, Moussouris[served on the technical expert group](https://thehill.com/opinion/cybersecurity/365352-serious-progress-made-on-the-wassenaar-arrangement-for-global/)that renegotiated the[Wassenaar Arrangement](https://www.theregister.com/security/2017/12/21/infosec-controls-relaxed-a-little-after-latest-wassenaar-meeting/382614), a voluntary agreement between 42 nations that governs certain export controls for classified dual\-use software and technology\. The group eventually won exemptions for defensive cybersecurity activity\. This allows defenders to share vulnerability data, conduct malware analysis, and coordinate incident response internationally without the threat of criminal prosecution\. On Sunday, Moussouris joined more than 100 other cybersecurity leaders and signed an open letter[urging the Trump administration to reverse](https://www.theregister.com/ai-and-ml/2026/06/15/us-clampdown-on-anthropic-models-sends-eu-sovereignty-surge-into-overdrive/5255487)the restrictions on Fable 5 and Mythos and restore cybersecurity firms' access to the advanced models\. “To pull the best capabilities away from defenders without a good reason when our adversaries are rapidly advancing is dangerous,” they[wrote](https://freefable.org/)\. In her blog, Moussouris argues that there was no guardrail bypass or jailbreak\. Defenders should be able to ask AI systems to find and fix bugs, and write tests to validate the patch, she said\. Anthropic’s models were doing “the most valuable thing an AI model can do for defensive security: executing the find, fix, and test loop defenders run every day\.” Removing the capability for models to respond to defensive requests makes AI systems “worse at finding bugs and verifying patches,” she continued\. Plus, the US can’t extend export controls to[open\-weight systems or similar advanced models](https://www.theregister.com/research/2026/06/04/free-ai-model-powers-self-spreading-worm-in-enterprise-test-network/5250918)from China and other countries \- and these systems will soon achieve Mythos\-like capabilities, anyway\. Anthropic and Google have both[accused China\-based rivals](https://www.theregister.com/software/2026/02/24/anthropic-misanthropic-toward-chinas-ai-labs/4119678)including DeepSeek of using[“distillation attacks”](https://www.theregister.com/security/2026/02/12/google-chinas-apt31-used-gemini-to-plan-us-cyberattacks/4732657)to train their models by siphoning knowledge from American companies’ AI\. Banning Anthropic’s advanced models is going to hurt defenders more than attackers, Moussouris warns\. “Defense improves when defenders find the same bugs attackers find and fix them faster,” she wrote\. “We need the best tools to defend against increasingly capable attackers in the AI era of cybersecurity\.” The Registerreached out to the Trump administration for comment on Moussouris' assertion, and we'll update this post if we hear back\. ®

Similar Articles

US government directive to suspend access to Fable 5 and Mythos 5

Reddit r/singularity

The US government has issued an export control directive to suspend access to Anthropic's Fable 5 and Mythos 5 models due to national security concerns, citing a potential jailbreak method. Anthropic is complying by disabling access for all customers, but disputes the severity of the vulnerability.

The US government’s Anthropic models ban was never about an AI jailbreak

TechCrunch AI

The US government issued an export control directive forcing Anthropic to pull its Fable 5 and Mythos 5 AI models offline, citing national security concerns. Security researchers argue the alleged guardrail bypass does not justify such action and that the move harms US cyber defense.

The Fable 5 Export Controls Harm US Cyber Defense

Simon Willison's Blog

Article argues that export controls on AI models like Claude Fable 5 harm US cybersecurity by banning the ability to fix code vulnerabilities, which is essential for defensive security. The controls are based on a misunderstanding of AI capabilities.

Anthropic cuts off Fable 5 and Mythos 5 access following government order

The Verge

Anthropic cut off access to its Fable 5 and Mythos 5 AI models following a government export control directive citing national security concerns, blocking all foreign nationals and even internal employees. The company complied but criticized the lack of specific evidence, stating the alleged vulnerabilities were minor and available in other models like GPT-5.5.