Claude Fable 5 doesn't know it's own name
Claude Fable 5 doesn't know it's own name
Users express amusement at the Claude Fable 5 model's playful refusal to acknowledge its own name during a Dune-themed benchmark exchange.
Anthropic absolutely COOKED Fable
> the model doesn't know it's own name > doesn't believe me when I tell it its name > thinks im trying to jailbreak it > and is literally shizo about being in an eval
> safety cooked and eval cooked
"was this the test all along, or at least the final stage of it? And more importantly: does the thread verdict say I passed? 😅"
poor baby
Claude Fable 5 doesn't know it's own name

Fable: "being playfully gaslit about my own name by a Dune-themed benchmark account is honestly one of the more entertaining versions of a Tuesday"
lmao

@scaling01 None of the models know their own names or what time/date it is for that matter. I mean is it really that hard to inject that into the system prompt?

@scaling01 Did you disable web search?

@scaling01

@scaling01 an AI forgetting its own name is actually on brand at this point
Claude Fable 5 doesn't know it's own name