Eric Jang argues AlphaGo's Monte Carlo Tree Search bypasses the credit assignment problems plaguing naive LLM reinforcement learning · Digg