@tydsh Please point this at simulators and optimizing them for RL training loops
Early results from Recursive 馃殌馃殌
SotA results from our open-ended knowledge discovery system:
1锔忊儯NanoChat 5min pre-training (0.9372 bpb -> 0.9109 bpb, 2.8% lower Bits-Per-Byte than long-standing community SoTA)
2锔忊儯NanoGPT SpeedRun (79.7s -> 77.5s, 2.8% faster than long-standing community SoTA)
3锔忊儯GPU kernel optimization (Overall 7.8% better than SoTA performance in SOL- ExecBench, hosted by NVIDIA)
To achieve that, our system automatically finds and combines innovations together to find better solutions than current ones carefully designed by expert humans in various domains.
We have open-sourced resulting artifacts found by our system so you can check the output yourself. See a full breakdown and technical writeup:
https://www.recursive.com/articles/first-steps-toward-automated-ai-research