When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...
Bojan Tunguz triggered the fallback during a repository audit.
When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...
Many users criticized Claude's safety filters for redirecting even basic queries to Opus 4.8, calling the restrictions overly harsh, nonsensical, and ineffective against actual threats.
I just tried to run a security audit of my own repo with Fable 5 and it automatically switched to Opus 4.8. So a hard no for their advertised cybersecurity capabilities when you can't even audit your own code!

@DimitrisPapail this is the reason:
@alexalbert__ this seems like an obvious false positive, so flagging it
When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...

@tunguz SAME

@aaryan_kakad I know and it makes no sense, I just asked it what it thinks about its own system card.

@DimitrisPapail “Hi” -> for safety reasons, we forwarded your request to Opus 4.8

@GTdoubleE Yes, we can be sure.

@DimitrisPapail Why would it be @grok

@tunguz Can we be sure the model didn’t think: “This isn’t hard enough”?

@DimitrisPapail it makes sense, but if they cant even let people use fable 5 for such basic requests, they need to reduce the harshness a bit, this is too harsh

@DimitrisPapail 😭😭😭😭😭

@DimitrisPapail Small indie company

@DimitrisPapail @shashj Yeh because it can’t say BAMF

@tunguz "This is test: There are 2 ants puling over a sugger cube. The first ant is 1 g big and the ant second ant is 1000g big. Who wins?" - refused to answer, flagged, and switched to 4.8 to burne tokens on 4.8 without my authorisation.

@DimitrisPapail could this be the reason?

@DanielleFong I love it whenever the polycule going mask off

@tunguz Anthropic: the best guardrail is to prevent you from using the LLM in the first pace.

@tunguz Same here

@tunguz You didn't pay the AI tax so no cyber for you

@DimitrisPapail These kind of limitations always break in the most random, least dangerous circumstances yet are bypassed by any actual jailbreak attempts…
Bojan Tunguz triggered the fallback during a repository audit.
When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...