ICML paper finds internal data repetition during LLM pretraining can waste up to 33% of compute resources · Digg