Alistair Letcher mathematically proves model-free reinforcement learning agents build internal world models when trained on diverse goals · Digg