Prime Intellect's Elie Bakouch says GLM5.2 reintroduces critic models, prompting debate on PPO value estimation trade-offs · Digg