if a model refuses, it should score as 0 on that task
1:57 PM · Jun 9, 2026 · 6K Views
if a model refuses, it should score as 0 on that task
Positive users back zeroing AI benchmark scores for refusals while negative users mock the evaluations as pointless or absurd.
Also, refusals on MMLU?? What are they even doing over there
if a model refuses, it should score as 0 on that task

@xeophon wtf are these evals.
Someone should just fine tune gemma to just refuse to do anything and route to Best-of-n across all models

@xeophon 100%

@xeophon Might as well report 0 on all new AI benchmarks, saves cost

@jconorgrogan tempting…
if a model refuses, it should score as 0 on that task