3h agoMatt Wiemann and Lindsay M. Smith launch DiscoverPhysics to evaluate LLM agents on scientific experimentation and physical law discoveryThe benchmark focuses on autonomous scientific theory formulation.