@eliebakouch @kellerjordan0 @karpathy Thanks @eliebakouch! Really impressed with what you do at @PrimeIntellect as well 🚀
I agree, finding the right open-ended & safe environments is going to be key to develop this technology further.
very cool results!
i really believe speedruns by @kellerjordan0 @karpathy and others are a great testbed for RSI like tasks, both for evaluating and improving how models do research
there are still limitations in how current speedruns are designed, and work needed on the system side to make agents efficient. seeing what models can do in math/code, i believe one missing piece to achieve the same for ai research is giving them the right environment