Anthropic Shifts Warning from AI Job Loss to AI Self-Improvement Risk
Anthropic Warns of AI Self-Improvement Over Job Loss

Anthropic spent most of 2026 warning the world that artificial intelligence was coming for white-collar jobs. CEO Dario Amodei delivered this message at Davos, in essays, at the India AI Impact Summit, and on numerous podcasts. He predicted unemployment could hit 10 to 20 percent within five years, with coding being the first sector affected, followed by finance and law. Pension funds and sovereign wealth managers, weighing a reported $900 billion valuation for Anthropic, heard a market the size of the global wage bill. Now the company has a new message that sits awkwardly on top of the old one.

In a paper exceeding 10,000 words titled When AI Builds Itself, Anthropic argues that the bigger danger is not a wave of laid-off coders and junior analysts. Instead, it is the prospect of an AI system capable of designing and training its own successor, with humans pushed to the edges of a process they used to run from start to finish. The company calls this recursive self-improvement. It states that we are not there yet, but then spends thousands of words laying out evidence that the gap is closing fast. It warns that if the gap does close, the work of securing, monitoring, and shaping these systems becomes far more urgent than anyone is currently treating it.

Inside Anthropic, Claude is already writing the company that builds Claude. The paper, co-authored by Marina Favaro and Jack Clark of the Anthropic Institute, leans on internal data the company has never disclosed before. As of May 2026, more than 80 percent of the code merged into Anthropic's own codebase was authored by Claude. Before Claude Code launched in February 2025, that share sat in the low single digits. The average Anthropic engineer now ships eight times as much code per quarter as they did between 2021 and 2025. In a March 2026 internal poll, the median researcher said they were producing roughly four times as much output with the still-internal Mythos Preview model as they would have without any AI at all.

Wide Pickt banner — collaborative shopping lists app for Telegram, phone mockup with grocery list

One example the paper offers is hard to dismiss. In April 2026, Claude shipped over 800 fixes that cut a class of API errors by a factor of one thousand. The engineer overseeing the work estimated a human would have taken four years to finish it. External evaluator METR reports that Mythos Preview can now work on its own for at least 16 hours, sitting at the upper end of what the lab can even measure. The length of tasks AI can complete reliably is doubling every four months, up from seven last year.

The risk Anthropic is now flagging is not unemployment; it is losing the steering wheel. This is where the framing pivots away from Amodei's job-loss circuit. Anthropic argues that if an AI system becomes capable of building its successor with no meaningful human input, alignment stops being a research problem and becomes a survival one. The paper warns that the rare cases of misalignment present in today's models could compound as those models build the next generation, growing more frequent but less understood until humans lose control of them.

The company also hints at how fast things move once an AI is loose at scale. Project Glasswing, Anthropic's program that gives Mythos Preview to a small group of trusted partners, surfaced more than 10,000 high- and critical-severity software vulnerabilities across major systems in its first few weeks. The bottleneck has already shifted from finding holes to patching them fast enough.

Why Anthropic is now floating the idea of a coordinated pause on frontier AI is striking for a company chasing a reported $900 billion valuation. The paper's closing section says Anthropic would support a slowdown or temporary pause on frontier AI development, on one condition: that other top labs do the same, in a way that can be independently verified. The company concedes this is hard. Training runs are easier to hide than missile silos. The incentive to defect is enormous. There is no global body to adjudicate. But Anthropic says the Anthropic Institute will start building the verification systems a credible pause would need, and will convene policymakers, researchers, and other AI companies in the coming months.

Pickt after-article banner — collaborative shopping lists app with family illustration

For now, Claude still waits for Anthropic engineers to tell it what problem to solve. The paper's argument is that the gap is closing, and that the public conversation has spent too much time on whose job goes first, and not enough on who, or what, is in charge when the process starts running itself.