Briefly
- Anthropic has reportedly embedded roughly half a dozen engineers on the NSA to deploy its Mythos AI mannequin for offensive cyber operations—probably together with assaults on networks in China and Iran.
- Anthropic additionally warned that AI is approaching recursive self-improvement and referred to as for a coordinated international pause mechanism.
- Each landed as Anthropic information for an IPO that might worth it above $1 trillion.
Anthropic has positioned about six engineers contained in the Nationwide Safety Company to assist deploy Mythos—its most succesful AI mannequin—for offensive cyber operations, the Monetary Occasions reported Thursday.
The engineers are forward-deployed workers, customizing the mannequin for particular functions. One supply instructed the FT it might be helpful for infiltrating networks in international locations like China and Iran.
Whether or not these engineers are concerned in lively operations is not confirmed. What’s: Mythos is similar mannequin Anthropic has declined to launch publicly, citing misuse danger. The corporate restricted it to vetted companions by way of Mission Glasswing—a restricted coalition that features Microsoft, Apple, and Amazon.
Anthropic can be suing the Pentagon. In late February, Protection Secretary Pete Hegseth designated the corporate a supply-chain danger—a label traditionally reserved for overseas adversaries like Huawei—after a $200 million contract collapsed. The sticking level: Anthropic refused to let the DoD use Claude for totally autonomous weapons or home mass surveillance. The NSA contract was exempt from that ban.
A California decide blocked the blacklisting as an obvious First Modification retaliation. A D.C. appeals court docket denied Anthropic’s bid to halt it whereas litigation performs out. The NSA stored utilizing Mythos the entire time, in response to the FT’s reporting.
The best way to cease AI that builds AI
On the identical day the NSA story broke, Anthropic’s inside analysis institute revealed “When AI Builds Itself,” a take a look at how far Claude has come at automating its personal improvement. In it, the corporate argues for basically a worldwide moratorium within the AI arms race—and even likened it to Chilly Conflict-era nuclear treaties struck between america and Russia.
To know why, the corporate offered this little bit of context:
Claude now writes greater than 80% of the code merged into Anthropic’s manufacturing codebase—up from low single digits earlier than Claude Code launched in early 2025. Engineers ship roughly eight occasions as a lot code per day as they did in 2024.

The report’s authors—Anthropic Institute lead Marina Favaro and co-founder Jack Clark—argue this trajectory is heading towards what they name recursive self-improvement: AI techniques that autonomously design, construct, and practice their very own successors, with people taking part in a diminishing position at each step.
In a visible illustration, the researchers present a timeline by which the primary manner to make use of AI at work as people prompting the pc to get a end result, with growing automations ending in AI Brokers prompting subagents till the result’s achieved, no people concerned.

The sharpest knowledge level they cite: In April, Claude brokers had been handed an open AI security drawback—whether or not a weaker mannequin can reliably supervise a stronger one—and left to run it. Two human researchers over a few week recovered 23% of the efficiency hole between the fashions. The brokers recovered 97%, over 800 cumulative compute hours. People set the query. The brokers designed each experiment. It is the primary revealed case of Claude exercising analysis judgment, not simply executing duties another person specified.
That is the road Anthropic is apprehensive about crossing. As soon as AI chooses which experiments are price working—not simply runs them—people lose the final significant position within the improvement loop. Small misalignments seen in immediately’s fashions might compound throughout self-improving generations till no person can right them.
Their proposed repair is a verifiable international pause—a number of frontier labs halting concurrently, with impartial verification that everybody truly stopped. Anthropic mentioned it will be a part of one. A unilateral slowdown, they acknowledge, simply arms the result in whoever stored going.
We’ve seen this film earlier than. The labs constructing AI are the identical ones warning how harmful AI is. Nonetheless, AI is probably the most worthwhile enterprise of the last decade, so no person desires to cease—not even those warning about AI.
Again in 2023, over 100 massive names within the AI researching group signed an open letter asking for a worldwide effort to mitigate the chance of extinction that AI improvement intrinsically has. A number of months earlier than that, one other open letter demanded OpenAI pause advances on ChatGPT attributable to its harmful nature.
No person stopped after the 2023 open letter. OpenAI did not. Anthropic did not. The Pentagon’s deadline to drop Claude from its techniques falls in August, across the similar time Anthropic’s IPO is anticipated to maneuver its funds into public view.
Every day Debrief E-newsletter
Begin on daily basis with the highest information tales proper now, plus unique options, a podcast, movies and extra.
