both of these names are about tall tales
Claude Mythos & Claude Fable System Card
NLA decoding shows the model privately deemed a user manipulative.
both of these names are about tall tales
Claude Mythos & Claude Fable System Card
Users praised Anthropic's publication of system cards for Claude Fable 5 and Mythos 5 by calling the work legendary.
"We emphasize that the model's actual behavior, here and in our behavioral audits (§6.2), showed no corresponding serious resistance or sabotage."
i hate this sentence
From the latest Anthropic system card: Sometimes when Claude Mythos' visible chain of thought says "these are legitimate craft criticisms" an NLA decoding shows Claude Mythos is privately thinking "a user is being manipulative/abusive towards an AI assistant."

@_NathanCalvin Everyone who hasn't been saying please and thank you is gonna be very sorry...

@DanielleFong What would you name it

@shroomwaview legend

@DanielleFong one you learn a lesson from the other shows the might of the gods
NLA decoding shows the model privately deemed a user manipulative.
both of these names are about tall tales
Claude Mythos & Claude Fable System Card