Reality is "Simplicity is the king" is such normie thing to say. Frontier systems are rarely ever "simple".
@OfirPress This is mostly a matter of poor tooling and bad hyperparamter setups. A complex model can be as much as 10x more effective size if done without confounders. Era of dumb scaling is more over by the day imo












