It鈥檚 almost certainly because they did some kind of A/B testing.
My guess is they defined frontier models development so broadly to transparently refuse it would essentially stop working.
I could also see them thinking that a transparent refusal could be worked around with volume of requests.
the silently failing strategy is a bit weird. I can speculate it is to prevent jailbreak, but not doing same for other safety risks is one rare case of non-consistent strategy.


