Marin researchers validate scaling laws by successfully predicting a 129-billion-parameter MoE model's loss before training · Digg