Anthropic’s Project Glasswing: The Dangerous AI Cyber Model Too Powerful to Release

Anthropic has announced Project Glasswing, a sweeping cybersecurity initiative that pairs an unreleased frontier AI model—Claude Mythos Preview—with a coalition of twelve major technology and finance companies. The goal: find and patch software vulnerabilities across the world’s most critical infrastructure before adversaries can exploit them.

The launch partners include Amazon Web Services, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, Nvidia, and Palo Alto Networks. The coalition has extended access to more than 40 additional organizations that build or maintain critical software, with Anthropic committing up to $100 million in usage credits for Claude Mythos Preview.

**Why This Model Can’t Be Released**

At the center of Project Glasswing sits Claude Mythos Preview, a general-purpose frontier model that Anthropic says has already identified thousands of high-severity zero-day vulnerabilities in every major operating system and every major web browser.

“We do not plan to make Claude Mythos Preview generally available due to its cybersecurity capabilities,” Newton Cheng, Frontier Red Team Cyber Lead at Anthropic, told VentureBeat. “However, given the rate of AI progress, it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely. The fallout—for economies, public safety, and national security—could be severe.”

**Extraordinary Results: 27-Year-Old Vulnerabilities Found**

The technical results are remarkable. According to Anthropic’s press release, Mythos Preview was able to find nearly all vulnerabilities it surfaced and develop many related exploits entirely autonomously, without human steering.

Three examples stand out:

• **OpenBSD Vulnerability**: The model found a 27-year-old vulnerability in OpenBSD—widely regarded as one of the most security-hardened operating systems in the world, commonly used to run firewalls and critical infrastructure. The flaw allowed an attacker to remotely crash any machine running the OS simply by connecting to it.

• **FFmpeg Flaw**: A 16-year-old vulnerability in FFmpeg—the near-ubiquitous video encoding and decoding library—was discovered in a line of code that automated testing tools had exercised five million times without ever catching the problem.

• **Linux Kernel Exploit**: Mythos Preview autonomously found and chained together several vulnerabilities in the Linux kernel to escalate from ordinary user access to complete control of the machine.

All three vulnerabilities have been reported to the relevant maintainers and patched.

**Performance Benchmarks: A Significant Leap**

On the CyberGym evaluation benchmark, Mythos Preview scored 83.1%, compared to 66.6% for Claude Opus 4.6, Anthropic’s next-best model. The gap is even more pronounced on coding benchmarks: Mythos Preview achieves 93.9% on SWE-bench Verified versus 80.8% for Opus 4.6, and 77.8% on SWE-bench Pro versus 53.4%.

**Responsible Disclosure at Scale**

Finding thousands of zero-days at once raises a critical question: how does Anthropic plan to handle disclosure without overwhelming open-source maintainers, many of whom are unpaid volunteers? Cheng outlined a triage pipeline where the highest severity bugs go to professional human triagers who validate every report before sending to maintainers.

The company follows a coordinated vulnerability disclosure framework. Once a patch is available, it generally waits 45 days before publishing full technical details, giving downstream users time to deploy the fix.

**What Partners Are Saying**

Despite recent security lapses, major partners are moving forward. CrowdStrike’s CTO Elia Zaitsev framed the initiative in terms of collapsing timelines: “The window between a vulnerability being discovered and being exploited by an adversary has collapsed—what once took months now happens in minutes with AI.”

AWS Vice President and CISO Amy Herzog said her teams have already been testing Mythos Preview against critical codebases. Jim Zemlin, CEO of the Linux Foundation, pointed to the fundamental asymmetry that has plagued open-source security for decades.

Anthropic has donated $2.5 million to Alpha-Omega and OpenSSF through the Linux Foundation, and $1.5 million to the Apache Software Foundation.

**Financial Momentum**

The announcement arrives alongside significant financial momentum for Anthropic. The company disclosed that its annualized revenue run rate has surpassed $30 billion, up from approximately $9 billion at the end of 2025, with more than 1,000 business customers each spending over $1 million annually—doubling in less than two months.

Project Glasswing represents Anthropic’s most ambitious attempt to translate frontier AI capabilities—capabilities the company itself describes as dangerous—into a defensive advantage before those same capabilities proliferate to hostile actors.

Related Posts

Newsletter

Join the discussion Cancel reply