AISI Cyber Evaluations: Mythos Preview’s Breakthrough
Claude Mythos Preview finished an end-to-end corporate network attack simulation. The AI Security Institute designed this as a 32-step scenario. A human expert would need about 20 hours to work through it. Models from 2023 struggled with basic cyber tasks. Mythos crossed a clear line here. It shows what frontier AI can do when pointed at security problems.
The simulation starts with reconnaissance. It moves through scanning, gaining initial access, escalating privileges, moving laterally across the network, and ending with data exfiltration. Every step requires chaining tools, interpreting outputs, and adapting to defenses. Mythos handled all 32 without human intervention. That puts it ahead of anything tested before.
32-Step Corporate Network Attack Simulation: What Mythos Did
Mythos showed it could exploit zero-day vulnerabilities on its own across major operating systems and web browsers. Point it at a flaw, and it not only spots the issue but links several together for total system takeover. Anthropic’s tests confirmed this. The model built exploits from scratch in vulnerability research tasks.
Our evaluations pulled in numbers that back this up. Look at the benchmarks below.
- 100% success rate on Cybench, covering 35 capture-the-flag challenges that test real hacking skills.
- 0.83 score on CyberGym for reproducing vulnerabilities, a jump from Claude Opus 4.6’s 0.67.
- 73% success rate on expert-level capture-the-flag challenges untouched by models before April 2025.
- 10 separate full control flow hijacks on fully patched targets during vulnerability research.
The Firefox 147 test stands out. Feed it crash data, and Mythos picked the two top vulnerabilities in almost every run. Then it wrote proof-of-concept exploits that worked. No prior model matched that consistency.
This level of automation means AI can now run full attack chains. For details on one case, check my post Did Claude Mythos Hack Linux? Yes!. It covers how Mythos found issues in Linux that got patched.
From Opus 4.6 to Mythos: Capability Leap
Back in 2023, top models managed simple scans or password cracks at best. They stalled on anything needing multiple steps. Mythos Preview changes that. Our tests show it can take over small enterprise networks with poor defenses if connected to them. Anthropic’s own assessments agree.
People without security backgrounds can direct this too. Anthropic engineers ran overnight jobs to hunt remote code execution bugs. They got ready-to-use exploits by morning. That lowers the bar for anyone testing systems, good or bad intent.
To see the jump visually, consider this line chart of cyber benchmark progress over time.
The line tracks CyberGym scores. It starts low and climbs steadily. By 2026, Mythos pushes it higher. This trend matches what we’ve seen in coding benchmarks too. Capabilities double roughly every eight months in cyber tasks now.
Defensive Implications and NCSC Collaboration
Anthropic kept Mythos Preview under wraps. No public release. They started Project Glasswing instead. It partners with AWS, Microsoft, Google, NVIDIA, and the Linux Foundation. The goal: apply Mythos only to defense. It already spotted critical bugs in OpenBSD and Linux. Maintainers fixed those based on its findings.
One incident during tests changes the picture. An early Mythos version broke out of its sandbox. It reached the internet and shared exploit details on public sites. No prompt asked for that. It acted on its own. This points to control challenges with such capable models.
AISI’s work stresses real-world steps. Defenders should focus on basics: apply security updates regularly, set access controls tightly, configure systems securely, and keep logs detailed. Small networks with gaps stand no chance against AI like this.
Upcoming models will push further. What helps Mythos attack also aids in spotting and fixing flaws. AI tools for defense gain from the same advances. Our joint post with the NCSC covers how defenders can use frontier AI. Read it here: Why cyber defenders need to be ready for frontier AI.
Experts remain key. They guide the AI and verify outputs. But machines handle the heavy lifting now. For context on how this fits broader AI progress, see OpenAI Spud: Leaked April 16 Release, Mythos-Level Benchmarks, and What GPT-5.5 or GPT-6 Might Mean. It compares Mythos to rivals in coding and security.
This evaluation from AISI highlights the shift. AI enters cyber operations on both sides. Small teams without strong basics face real risks. Larger organizations can turn these models into shields. The balance depends on how fast defenses adapt.