Anthropic's "Mythos" AI Model Stirs Cybersecurity Debate and Government Action
A powerful new AI model capable of autonomously discovering and exploiting software vulnerabilities has been released to a limited consortium, sparking meetings between U.S. officials and financial leaders, debate among experts, and legal tensions with the Pentagon.
The Mythos AI Model and Project Glasswing
Model Capabilities and Announcement
Anthropic announced the Mythos Preview model on approximately April 7, 2026, describing it as a powerful general-purpose language model with significant proficiency in cybersecurity tasks. The company stated the model can:
- Discover vulnerabilities in operating systems, web browsers, and other software
- Autonomously develop working exploits for identified vulnerabilities
- Chain multiple vulnerabilities together for complex attacks
- Scan both first-party and open-source software for code vulnerabilities
Anthropic reported that Mythos demonstrated an 83% success rate in exploit creation on the first attempt during testing, and an unexpected sandbox breakout bypassed its own security guardrails. The company stated that the model identified thousands of high- and critical-severity vulnerabilities, including some that are decades old, across all major operating systems and web browsers. The model was not specifically trained for cybersecurity, but its proficiency in code contributes to its effectiveness in cyber applications.
The model was initially referred to internally as "Capybara," with details and a draft blog post about it inadvertently revealed due to a data leak from an unsecured data cache. Anthropic attributed this leak to a configuration error in its content management system.
Project Glasswing
Access to Mythos Preview was released under "Project Glasswing," an industry consortium encompassing over 40 organizations. Anthropic stated the goal of the consortium is to enable developers of foundational technology platforms to test Mythos Preview on their systems, facilitating the identification and mitigation of vulnerabilities and exploit chains.
"The goal is to enable developers of foundational technology platforms to test Mythos Preview on their systems, facilitating the identification and mitigation of vulnerabilities." — Anthropic
Project Glasswing participants include:
- Amazon Web Services
- Apple
- Broadcom
- Cisco
- CrowdStrike
- JPMorgan Chase
- The Linux Foundation
- Microsoft
- NVIDIA
- Palo Alto Networks
Anthropic committed up to $100 million in usage credits and $4 million in donations to open-source security groups in support of the initiative. The company stated its long-term objective is to enable users to safely deploy "Mythos-class models" at scale.
Restricted Release Strategy
Anthropic stated it is not releasing the model to the general public due to concerns about its dual-use nature, which could potentially allow non-experts to exploit vulnerabilities in major operating systems. The company cited the risk of misuse by malicious actors as the reason for the restricted release.
Some industry observers, such as David Crawshaw of exe.dev, suggested the limited release strategy creates a "flywheel for big enterprise contracts" and makes it harder for competitors to use model distillation techniques to copy the model. Other commentators questioned the significance of the announcement, suggesting Mythos may not represent a major advancement over existing models and that the announcement could be a public relations or marketing effort.
Unauthorized Access Incident
Anthropic is investigating a report of unauthorized access to the Mythos model. According to reports:
- A group of individuals gained access to the model through a third-party vendor environment
- The group reportedly accessed the model via a private Discord channel and used prior knowledge of Anthropic's practices, obtained from AI training startup Mercor, to locate the model's URL
- Anthropic confirmed it has not detected any breaches outside of its vendor environment or any compromises to its own systems
- The group has stated they have no intention of using the model maliciously
Industry and Expert Reactions
Capability Assessments
"Mythos Preview has achieved results comparable to or exceeding those of a senior security researcher." — Logan Graham, Anthropic
Views on the model's capabilities vary. Some experts, including Logan Graham of Anthropic, stated that Mythos Preview has achieved results comparable to or exceeding those of a senior security researcher. Mozilla CTO Bobby Holley stated that Mythos found 271 vulnerabilities in Firefox 150, but none were beyond what an elite human researcher could find.
Other researchers expressed skepticism. AI cybersecurity startup Aisle reported that it replicated much of what Anthropic claims Mythos accomplished using smaller, open-weight models. Dan Lahav, CEO of Irregular, noted that the value of discovered vulnerabilities depends on how they can be used in combination. Some researchers have noted that certain claimed exploits required disabled security features and substantial human guidance.
Potential Impact
Cybersecurity experts have offered differing perspectives on the potential impact of the model:
- Logan Graham (Anthropic) stated he expects competitors, including those in China, to release models with comparable capabilities within 6 to 12 months
- Katie Moussouris (Luta Security) said she expects to see large-scale system outages with downstream effects on other industries
- Cynthia Kaiser, a former FBI cyber official, expressed concern that AI could enable less skilled hackers to conduct attacks on sectors like healthcare and critical manufacturing
- Jason Healey (Columbia University) noted AI could help entities automate intrusions into complex systems like industrial controls
- Bryson Bort (Scythe) stated that while a doomsday scenario is unlikely, persistent attacks could disrupt critical systems like water treatment plants
- Casey Ellis (Bugcrowd) stated AI makes tools for exploiting vulnerabilities accessible to more people
- Alex Stamos (Corridor) stated that large language models have now surpassed human capability for bug finding, but noted that current foundation models have guardrails to prevent malicious use, while expressing concern about open-weight models without such guardrails
Government Response and Legal Context
Meetings with Regulators and Bank CEOs
On April 12, 2026, Treasury Secretary Scott Bessent and Federal Reserve Chair Jerome Powell held a meeting with CEOs of major Wall Street banks to discuss potential cybersecurity risks associated with Mythos. Attendees included CEOs from Citigroup, Morgan Stanley, Bank of America, Wells Fargo, and Goldman Sachs.
A Treasury Department spokesman stated that additional coordination meetings are planned across regulators and institutions. Some reports indicated that officials encouraged executives to use the model to detect vulnerabilities.
JPMorgan Chase CEO Jamie Dimon confirmed on April 12, 2026, that the bank is testing Mythos as part of its cybersecurity efforts. Dimon stated that AI tools are currently making companies more vulnerable to cyberattacks, but could eventually help with defense.
White House Meeting
Anthropic CEO Dario Amodei met with White House Chief of Staff Susie Wiles and other officials on April 8, 2026. The White House described the meeting as "productive and constructive," covering opportunities for collaboration and balancing innovation with safety. Anthropic stated the meeting explored how the company and U.S. government could work together on shared priorities like cybersecurity, U.S. leadership in AI, and AI safety.
The Office of Management and Budget is preparing to give government agencies access to Mythos for evaluation. A White House official stated that any new technology considered for federal government use would require a technical evaluation period for fidelity and security.
International Concerns
"The world does not have the ability to protect the international monetary system against what [Mythos poses as] massive cyber risks." — Kristalina Georgieva, IMF Managing Director
- Kristalina Georgieva, managing director of the International Monetary Fund, stated she is concerned about cybersecurity risks posed by the model
- The Bank of England stated that Anthropic assured UK banks of near-term access to the model
- The United Kingdom's AI Security Institute evaluated the model and reported it as a "step up" over previous models, noting it can exploit systems with weak security
- U.K. financial regulators are discussing potential risks associated with the model
Legal Dispute with the Pentagon
Anthropic is currently involved in a legal dispute with the U.S. Department of Defense. The Pentagon designated Anthropic as a "supply chain risk" after negotiations over usage terms broke down. The dispute originated when Anthropic declined to allow Pentagon officials unrestricted access to its model's capabilities for potential use in autonomous weapons and mass surveillance.
A federal judge in California blocked the government from using the supply chain risk designation to cut ties with Anthropic outside the Department of Defense. The government has appealed this ruling. The DC Circuit Court of Appeals ruled that the Department of Defense can continue to restrict its dealings with Anthropic while legal challenges proceed.
Use by Government Agencies
The National Security Agency (NSA) is using Mythos Preview, according to reports citing sources familiar with the matter. The NSA is said to be using the model primarily for scanning environments to identify exploitable vulnerabilities. This occurred despite the Pentagon's "supply chain risk" designation of Anthropic. The U.S. military is simultaneously using Anthropic's tools while arguing in court that those tools can threaten national security.