Research shows predicting internal latent representations yields exponentially better sample efficiency than next-token prediction on structured benchmarks · Digg