The core exploit I'm attempting is that all of the tricks that people use for sim2real basically shake out to what happens when you write a low fi simulator. Mujoco playground's G1 baseline has obs noise, for example. So why spend all this time computing accuracy?
I'm trying to accelerate RL for robotics. I think that everyone is doing it wrong. My goal is to use my influence to help change the trajectory that things are going by creating end to end reproducible loops that help unclog people's brains from the memes

