Anthropic: "Claude Mythos 5 does not seem close to being able to substitute for our Research Scientists and Research Engineers"
Anthropic document says Claude Mythos 5 cannot replace human scientists or achieve 2x AI research acceleration
Story Overview
Anthropic's internal document on Claude Mythos 5 highlights that the model still trails senior human researchers and engineers on core tasks, even after gains over Opus 4.7 and 4.8 in targeted benchmarks for cybersecurity, biology, and long-horizon work.
Limits on automated research speed
The assessment finds no sustained 2x acceleration in AI progress and echoes the April 2026 system card conclusion that Mythos 5 has not crossed the automated AI-R&D threshold.
Access split with a safer sibling model
Mythos 5 remains invitation-only via Project Glasswing while the related Claude Fable 5 ships publicly with stricter safety constraints for general reasoning and engineering use.
Some users praised Anthropic's report for realistically noting that Claude Mythos 5 cannot replace research scientists due to the messy creativity involved, while others sarcastically commented on its implications.
Most Activity
"Mythos 5 is likely unable to fully and reliably automate R&D for frontier projects spanning multiple weeks"
Anthropic: "Claude Mythos 5 does not seem close to being able to substitute for our Research Scientists and Research Engineers"
@scaling01 yeah and better pull up the ladder too if that R&D problem thats failing is academic AI research
"Mythos 5 is likely unable to fully and reliably automate R&D for frontier projects spanning multiple weeks"

@scaling01 timelines lengthened

This is actually the important calibration.
Mythos can be wildly useful and still not substitute for strong researchers.
Those are different claims.
Long-horizon execution ≠ scientific judgment. Parallel agents ≠ research taste. More tokens ≠ epistemic authority. A 9-hour artifact ≠ a verified discovery.
The danger is not that the model is useless.
The danger is users treating impressive autonomy as proof of truth.
Capability is GAS until it survives reality testing.
cc @ericweinstein who is championing the public outcry of federal government cuts in scientific research. We need MORE scientists to validate AI. It is NOT SMARTER THAN US.)

@scaling01 It is the Agent1 from Openbrain. Early days of singularity.

@scaling01 because they nerfed it to do that???

@scaling01 Big difference between replacing human engineers entirely and getting more output from half the headcount. The latter still causes a social crisis and is already plausible

@scaling01 "AntHroPic sTopPed hIrInG L5 aNd BeLoW"

@scaling01 just not yet, maybe on mythos 10

It can't replace the top 0.1% researchers picked after 10 interview rounds at the most resource-intensive AI lab on Earth… sure. But that’s not exactly comforting. Should we just relax while a machine went from “severe cognitive deficits” in 2018 to “yeah it’s impressive but it can’t yet replace the ~5,000 people out of 8 billion who dedicated their lives to outperforming everyone else” in 2026? Are we on the same planet where the average person’s closest friend peaks mentally while doomscrolling TikTok?

@scaling01 These are the highest IQ engineer /researcher geniuses you can possibly find btw that they have compared it with.

@__lightyear__ @scaling01

@scaling01 So it's possibly capable to fully and reliably automate R&D for frontier projects spanning *a single week*

@scaling01 Unable to do tasks spanning multiple weeks🤔 Maybe it is just a memory problem?

@nullvaluetensor @scaling01 Its definetly not nerfed at Anthropic

@scaling01 Loops...:(

@scaling01

@scaling01

@scaling01 Of course it cannot replace research scientists yet. Frontier work is still messy and creative.

@scaling01 mythos 5 sounds like a metal album