👤Person

Anthropic Mythos

3 articles

First tracked: May 8, 2026

Last updated: May 29, 2026

Latest Coverage

CVE-Bench: testing LLM agents on real-world vulnerability patches

Five frontier LLMs were tested on 20 real CVEs across three prompt types; no model reliably fixes vulnerabilities, with a best 50% solve rate and significant cross-family differences; token cost varies up to ~4x by model, and locate prompts are the hardest test of genuine security reasoning.

May 29, 20261%

RSI is the new AGI — and it’s just as hard to pin down

↗

The piece interrogates RSI (recursive self-improvement) as the next AI frontier, outlining key players, tangible funding/valuation signals, and the regulatory/public-policy debate, while highlighting concrete metrics from the ecosystem and ongoing progress gaps.

May 28, 20261%

Mozilla says 271 vulnerabilities found by Mythos have "almost no false positives"

↗

Mozilla says Mythos identified 271 Firefox vulnerabilities in two months with almost no false positives, aided by a custom harness and model improvements.

May 8, 20261%

Related Entities

📈StockAGI

👤PersonSocher Recursive Superintelligence

🏷️TopicMythos

📈StockRSI

📈StockCVE