success!! trained with pufferlib with my 4090
this was trained against a captured cudagraph from mujoco warp. the training loop runs at 250k SPS. this is 2 hours to train
Still not as good as the baseline. But its mine! Same obs & action set up as mujoco playground
6:31 AM · Jun 11, 2026 · 7K Views