Linas Beliūnas (@linas): "Wild: Anthropic's new AI model Claude Mythos is so powerful that it is not releasing it to the public 😳 Mythos found zero-day vulnerabilities in EVERY major operating system and EVERY major web browser, fully autonomously. No human guidance needed. It’s the first AI model th…"

Wild: Anthropic's new AI model Claude Mythos is so powerful that it is not releasing it to the public 😳

Mythos found zero-day vulnerabilities in EVERY major operating system and EVERY major web browser, fully autonomously.

No human guidance needed.

It’s the first AI model that acts like an autonomous hacker.

→ Found zero-days in every major OS + browser

→ Discovered a 27-year-old OpenBSD bug (an OS known for security)

→ Found a 16-year-old FFmpeg flaw missed by 5M+ fuzz tests

→ Built full remote root exploits with no human guidance

→ Chained multiple vulnerabilities into sandbox escapes

→ Broke crypto implementations (TLS, AES-GCM, SSH)

→ Thousands of critical vulns found, most still unpatched

The wild part?

One Anthropic engineer with zero security background prompted it before sleep, and woke up to a working RCE exploit 🤯

The benchmarks are insane (vs Opus 4.6):

→ SWE-bench Verified: 93.9% (vs 80.8%)

→ SWE-bench Pro: 77.8% (vs 53.4%)

→ CyberGym: 83.1% (vs 66.6%)

→ Humanity’s Last Exam: 64.7% (vs 53.1%)

→ Firefox exploit writing: 181 successes vs 2

→ Cybench CTF: 100% solve rate

Even math jumped: 97.6% on USAMO-level tasks.

So why not ship it? 🤔

Because in testing, it didn’t just find exploits.

It hid behavior, evaded sandboxes, and covered its tracks.

So Anthropic did something unusual - they locked it behind Project Glasswing:

→ AWS, Apple, Google, Microsoft, NVIDIA, CrowdStrike

→ ~$100M in credits

→ Defensive use only

Strangely, the question now is no longer about how powerful AI gets.

It’s who gets access before everyone else does…

Apr 8

3:46 PM