AI News: The Model That Has Everyone Freaked Out!

YouTube AI Channels Models

Summary

Anthropic’s unreleased Claude Mythos model demonstrates elite-level hacking and vulnerability discovery, prompting a private preview with major tech firms to patch software before public release.

Here's the AI News you probably missed this week (and some you definitely didn't) - Join the newsletter at https://futuretools.io/ for AI news in your inbox. Discover More: 🛠️ Explore...
Original Article
View Cached Full Text

Cached at: 04/21/26, 03:35 PM

TL;DR: Anthropic’s unreleased Claude Mythos model is so good at hacking that the company is keeping it private and giving only select security teams early access to patch software before bad actors catch up. ## Claude Mythos & Project Glasswing ### What is Claude Mythos? Anthropic’s internal codename for its “final frontier” model. According to the 245-page system card: > “Claude Mythos is an unreleased general-purpose frontier model that reveals a stark reality: AI coding capabilities have advanced to the point where the model can discover and exploit software vulnerabilities at a level only matched by elite human hackers.” Benchmark snapshots: - **Cyber Security Vulnerability Reproduction**: Opus 4.6 66.6 % → **Mythos 83.1 %** - **SWEBench Pro**: +24 percentage points over Opus 4.6 - **Terminal Bench**: +17 percentage points - **SWEBench Multimodal**: roughly **double** the score The model: - Found a **27-year-old vulnerability** in OpenBSD - Unearthed a **16-year-old bug** in FFmpeg - Chained multiple Linux kernel flaws—software that powers most of the world’s servers Anthropic’s verdict: “These capabilities could soon proliferate to actors who may not act responsibly, with potentially severe economic, public-safety, and national-security implications.” ### Project Glasswing Instead of releasing Mythos, Anthropic is running a private preview for security teams at Apple, Microsoft, NVIDIA, Cisco, CrowdStrike and others. The goal: let big tech patch their products before equivalent models appear in the wild. Key quote from Anthropic’s video: “We did not specifically train it on cybersecurity; we simply trained it to be excellent at code, and a side effect is elite-level hacking.” ### Déjà vu or genuine caution? 2019’s GPT-2 “too dangerous to release” memo looked like hype in hindsight. This time the caution feels real: Anthropic is handing the skeleton key to competitors first, asking them to lock their own doors. Responsible AI move or not, users win if iPhones, Windows boxes and Chrome installs get hardened before the next worm drops. ## Two New Public Models Worth Your Time ### Meta Muse Spark - First release from Meta’s new **Super Intelligence Lab** (Alexander Wang of Scale AI now steering the ship; Yann LeCun in an advisory role) - **Closed-source**—Meta’s open-source crusade is on pause - Benchmarks show solid all-around performance and **best-in-class token efficiency**, i.e., cheap inference - Live inside [Meta AI](https://ai.meta.com); API access is wait-list only ### Zhipu GLM-5.1 - Dropped under the **MIT license**; weights on [Hugging Face](https://huggingface.co/THUDM/glm-5.1-32b) - **SWEBench Pro score: 58.4 %**, edging out GPT-5.4 (57.7 %) and Claude Opus 4.6 (57.3 %) - Runs locally, fine-tunable, and marks the first time an **open-source model** hits top-tier coding performance - Flying under the radar—if you want open heft comparable to the best closed labs, this is your drop ## Quick-fire Round - **Gemini** now generates **3-D models** and launched **Notebook LM-style “Gemini Notebooks”** - **Runway** pushed **Seedance 2.0** for longer, smoother AI dance videos - **CapCut** added generative AI stickers and one-click style transfer - **HeyGen** released **Forever Avatar V**—an avatar you train once, keep forever - **OpenAI** quietly added a **$200 ChatGPT Pro** tier with higher rate limits - **Claude** introduced **managed agents** for enterprise accounts Bottom line: the models that can break your code are staying locked up for now, while cheaper, open-weight challengers are arriving fast. Patch early, test freely, and keep an eye on the next drop. Source: [Matt Wolfe on YouTube](https://www.youtube.com/watch?v=SguncMvE77I)

Similar Articles

Claude Mythos Opens The Cybersecurity Pandora's box

Reddit r/artificial

Anthropic has unveiled Claude Mythos, a highly capable AI model designed to automatically discover security vulnerabilities in operating systems, browsers, and software libraries. Initially restricted to select enterprise and open-source partners under Project Glasswing due to dual-use risks, the release has sparked industry debate over AI security capabilities and corporate marketing tactics.