LisanBench creator @scaling01 argues GPT-5.5-xhigh lacks qualitative judgment and requires highly precise instructions · Digg