Leanstral 1.5 shows smooth test-time scaling on PutnamBench but draws skepticism from Princeton's Sanjeev Arora · Digg