Claude Fable has made significant progress on the current worlds in our DiscoverPhysics benchmark, led by @Space_Boy_Matt and @LindsayMSmith3, solving difficult worlds with latent structure that other models cannot!
So excited about this project. Despite all the talk about AGI, AI has barely scratched the surface of discovering scientific theories or even giving us new scientific insights. DiscoverPhysics is a benchmark for the future.