New post on Red today: Our team @AnthropicAI found that Mythos Preview is meaningfully better at developing N-days. It took us a couple thousand $ and a few hours to convert patches into exploits.
We publish research like this because we think it's important the world knows what models are/will be capable of.
In a year, Mythos will probably look trivial. We want to help the world to start preparing.
I'm excited to share a lot more blue team / defensive work. I feel like people are aware of the issue now, and the team's task is now to "solve it all" -- we have some exciting / interesting / creative defensive research lined up.

















