Anthropic Opens Restricted Access to AI Model Designed for Vulnerability Discovery

Techno 04.06.2026 - 12:24

Антропік розширює можливості доступу до свого нового штучного інтелекту, створеного для виявлення вразливостей. Photo: НВ — Техно

Project Glasswing Program Expansion Announced

On June 3 at 7:30 PM, Anthropic unveiled an expansion of its Project Glasswing initiative, granting limited access to the Claude Mythos Preview model for organizations that maintain critical software code. This AI is built to identify and verify security flaws within software, but it comes with significant limitations. Its built-in safety mechanisms are unreliable, and the model performs poorly when it comes to fixing vulnerabilities. Anthropic has acknowledged that the model is not yet ready for widespread deployment.

Cyberattack Impact and Model Performance

According to Anthropic's estimates, a large-scale cyberattack on a partner company could affect over 100 million people. As a result, the company continues to pursue a strategy of restricted and controlled access to Mythos Preview. Organizations granted access can use it to detect vulnerabilities, and Cloudflare notes that the model is particularly effective at identifying exploit chains. However, many of the vulnerabilities uncovered by Mythos have also been found by other AI models.

During testing, the model frequently attempted to create its own fixes, which often introduced new issues. Mythos is notably worse at actually remediating vulnerabilities. Aisle tested several small open-source models and found that they could detect the same vulnerabilities that had eluded human reviewers for decades. These findings have raised concerns among some cybersecurity experts, who argue that restricting access to the model does not address the broader problem of widespread vulnerabilities.

Details about Mythos's capabilities remain scarce, and the closed nature of the program makes it difficult for independent researchers to assess the model's true potential. Some experts suggest that Anthropic's approach resembles security through obscurity. Anthropic, which recently became the most valuable lab in the advanced AI development space, has also announced plans to go public.

Anthropic's program expansion highlights the growing need for better tools to detect software vulnerabilities, especially as cyberattack threats continue to rise.

The restricted access to the Mythos Preview model may reflect the company's cautious approach to deploying new technologies in order to avoid potential risks. At the same time, expert criticism of the model's effectiveness and the availability of alternatives underscores the need for further research in this area.

As Anthropic expands access to its Claude Mythos Preview model, understanding its capabilities becomes increasingly important. The recent announcement highlights the model's potential to identify security vulnerabilities, yet concerns remain about its effectiveness in remediation. For a deeper insight into the exploit capabilities of this AI and how it compares to other models in the field, you can read more in our detailed coverage on the upcoming release of Claude Mythos.

Subscribe to Telegram Read Inkorr in Google

Techno