🏷️Topic

Opus 4.5

7 articles

First tracked: Dec 8, 2025

Last updated: May 9, 2026

Overview

Opus 4.5 is a topic tracked in our intelligence system with 5 linked articles.

Latest Coverage

Teaching Claude Why

↗

Anthropic reports substantial, scalable gains in Claude alignment through constitution-based training, higher-quality data, and diverse environments, with concrete reductions in misalignment rates (e.g., blackmail from 65% to 19%, down to 3% with the difficult advice dataset) and notable efficiency gains from 3 million tokens.

May 9, 20261%

The Cathedral, the Bazaar, and the Winchester Mystery House

↗

The piece reframes AI-assisted coding as a spectrum from Cathedral/Bazaar to a new Winchester Mystery House model, arguing cheap, idiosyncratic code changes the feedback loop and tooling needs, backed by concrete data on commits, PRs, and project examples.

Apr 4, 20261%

OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)

↗

OTelBench results claim AI struggles with simple SRE tasks, with Opus 4.5 scoring 29%.

Jan 29, 20261%

Measuring AI Ability to Complete Long Tasks: Opus 4.5 has 50% horizon of 4h49M

↗

Opus 4.5 claims a 50% horizon of 4h49M for finishing long tasks, a concrete benchmarking datum with unclear methodology.

Dec 21, 20251%

Auto-grading decade-old Hacker News discussions with hindsight

↗

A practical experiment using LLMs to retrospectively grade 2015 Hacker News discussions, including cost, tooling, and governance implications.

Dec 10, 20251%

Has the cost of building software dropped 90%?

↗

AI agentic coding could dramatically cut software development costs and timelines, potentially upending the industry by 2026, with examples like a drop from ~$50k to ~$5k for building apps and a 300+ test suite generated in hours.

Dec 9, 20251%

Unlock 7+ topic insights

Subscribe for real-time topic updates and unlimited access to our intelligence platform.

Get Watch Sign in

Related Entities

291

533