i wonder how much of "model improvement perception" (and model hype) is just human psychology being reward hacked
for instance i kinda miss fable (i used it for like 1 day) and i find some opus outputs dumb, and i genuinely have no idea if it's a real difference or just me being reward hacked
















