When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...
Filters trigger on most cybersecurity or biology topics.
When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...
Users criticize Claude's safety filters redirecting new model queries to Opus 4.8 as ineffective and inconsistently applied, since they fail randomly yet are bypassed by jailbreaks.
@alexalbert__ this seems like an obvious false positive, so flagging it
When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...
i can already tell i am going to hate this
@DimitrisPapail broooooo
@natolambert can't even ask the model about its own system card...
@natolambert can't even ask the model about its own system card...
When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...
@eliebakouch oh it's worse
Unbelievable

@DimitrisPapail could this be the reason?

@DimitrisPapail These kind of limitations always break in the most random, least dangerous circumstances yet are bypassed by any actual jailbreak attempts…
Filters trigger on most cybersecurity or biology topics.
When a new model comes out, I like to give it its own system card and ask questions about it.
This does not work for Fable 5, as it routes me to Opus 4.8 for "safety" reasons...