How far ahead the labs can get depends in part on the returns to automation and scale, which includes the returns to greater algorithmic depth. If deep learning practice (and theory) is forever shallow then the moat will mostly not be algorithmic on the longer term because secrets will be relatively cheap to discover. Eventually distillation + data + time can catch up to compute scale, potentially slowly. So far this seems partly where we’re at, but even if true there are no guarantees it will continue this way.
If things become less shallow as we scale then every increment of automation and scale buy you algorithmic secrets that are increasingly out of reach for anyone else. This too seems partly where we're at. The end point in either case is when marginal utility returns to scale and research saturate. We don’t know where that is. Could be 2 OOMs or 20 away from where we are today. No one knows.
Compute is going to keep improving. Today’s best matmul machines are nowhere near the physical limits of AI accelerators. There’s a lot of room to get better at digital silicon. There are also many candidates for new substrates, and the algorithmic debt they owe will be automated to its limits, but we don’t yet know what the optimal one is for AI in space/energy/time/manufacturability/cost. Photonics and stochastic silicon are both interesting candidates, but I also expect the singularity to be surprising.
