One of the things I've observed time and time again, which makes me incredibly angry, is this SLAVE mentality reasoning: "If >big highly funded group< cant do it, what makes you think you can?"
I can. I will simply do it. Yes, I know better than them. Why do you give up?
you going to go in and make the sim run significantly faster than the combined efforts of Deepmind and NVIDIA? cause that's the bottleneck by far, the RL loop barely matters until you make a real dent there.
other than that it's about coming up with training schemes and algorithms that just learn more efficiently from rollouts, so you just need less samples (since you're sample-bound)














