Prime Intellect's Elie Bakouch argues that training 10-trillion parameter models requires staged training rather than a single massive run · Digg