OPUS improves LLM pre-training efficiency by aligning iteration-by-iteration data selection with the optimizer's geometry · Digg