Anthropic says the nerf only affects .03% of requests. That .03% is the people who change the world.
@theemozilla and @Karan4d, co-founders of Nous Research:
"The priority is to hide the fact that the classification is happening at all... how are people going to know when the model is being steered?"
"This whole... it's only gonna be triggered by .03% of people. It'll barely ever happen."
"How many people that are gonna change the world are there? .1% of the whole of everyone is a lot of people. Those are a lot of people."
"You're basically saying there are critical outlier people that move mountains... they're the only ones we're blocking. They're the only ones whose results we're fudging."
















