Furong Huang of the University of Maryland launches SoundnessBench to evaluate whether AI research agents can judge scientific soundness · Digg