πŸ‘€Person

Anthropic Mythos

3 articles
First tracked: May 8, 2026
Last updated: May 29, 2026

Latest Coverage

CVE-Bench: testing LLM agents on real-world vulnerability patches

β†—

Five frontier LLMs were tested on 20 real CVEs across three prompt types; no model reliably fixes vulnerabilities, with a best 50% solve rate and significant cross-family differences; token cost varies up to ~4x by model, and locate prompts are the hardest test of genuine security reasoning.

May 29, 20261%

RSI is the new AGI β€” and it’s just as hard to pin down

β†—

The piece interrogates RSI (recursive self-improvement) as the next AI frontier, outlining key players, tangible funding/valuation signals, and the regulatory/public-policy debate, while highlighting concrete metrics from the ecosystem and ongoing progress gaps.

May 28, 20261%

Mozilla says 271 vulnerabilities found by Mythos have "almost no false positives"

β†—

Mozilla says Mythos identified 271 Firefox vulnerabilities in two months with almost no false positives, aided by a custom harness and model improvements.

May 8, 20261%

Related Entities

πŸ“ˆStockAGI
8
πŸ‘€PersonSocher Recursive Superintelligence
1
🏷️TopicMythos
15
πŸ“ˆStockRSI
4
πŸ“ˆStockCVE
7