This is not just an efficiency gain; it represents a structural shift in the development pipeline. The consequence is that Anthropic engineers are now shipping eight times more code per quarter compared to the 2021–2025 baseline . The bottleneck in software creation is shifting from writing and testing to higher-level goal-setting, architecture, and judgment.
The report provides a timeline of breakneck acceleration across several standard measures of engineering and research competence :
Perhaps the most impactful metric for predicting recursive self-improvement is the duration of autonomous tasks. Research from METR tracks how long an AI can work independently at a 50% success rate. This task horizon has expanded from roughly 30 seconds in 2022 to 12 hours with Claude Opus 4.6 by April 2026, a 1,440-fold increase . Claude Mythos Preview can work for at least 16 hours, which is noted as being near the upper limit of what METR can currently measure
. The doubling rate for this horizon has accelerated from every seven months to every four months
.
The quantitative data on code and benchmarks is coupled with internal polling on human productivity. Analysis of 200,000 internal Claude transcripts and 53 in-depth interviews found that 27% of AI-assisted tasks were jobs employees simply would not have attempted without AI, because the time cost previously made them impractical . This is not automation of existing work but an expansion of what is possible to attempt at all. In a separate November 2025 internal study, employees reported using Claude in 60% of their work and estimated a 50% productivity boost, up from 20% the prior year
.
Anthropic's position is explicit. The company states, "We are not there yet, and recursive self-improvement is not inevitable. But it could come sooner than most institutions are prepared for" . It argues that a global ability to pause or slow AI development would "likely be a good thing" and is directly urging other labs to consider it
.
The same week, OpenAI's actions painted a contrasting picture. On June 3, OpenAI published a public policy agenda calling for a federal frontier AI safety framework with mandatory model evaluations and whistleblower protections, but also with a critical clause: the preemption of state-level safety laws . It explicitly asks the federal AI safety institute CAISI to prioritize monitoring progress toward recursive self-improvement
. Simultaneously, OpenAI is staffing up for this exact risk, creating a "Researcher, Recursive Self-Improvement Preparedness" role within its safety team with a posted compensation of $295,000 to $445,000
. The job is framed as a loss-of-control containment problem, a "tasteful and strategic" effort to mitigate risks that "might exist in the future, but might not exist now"
.
Both labs see the same wave approaching, but Anthropic is urging the fleet to slow down, while OpenAI is hiring lifeguards and arguing against any single state issuing a swimming ban.
Anthropic co-founder Jack Clark has separately estimated a 60% probability that this "loop" closes by the end of 2028 . The internal data in the June 4th post provides the factual underpinning for why that estimate is not a distant hypothetical but a projection from a curve that is already visibly bending upward.
Comments
0 comments