3h agoUniversity of Maryland's Furong Huang releases SoundnessBench to evaluate whether LLMs can judge the soundness of scientific research proposalsThe dataset is publicly available on Hugging Face