Inception highlights debate from this week's MLSys conference on whether large language models will rely on autoregressive methods or shift to diffusion-based approaches over the next decade
Kuleshov notes diffusion could replace sequential inference after parallel pre-training
ββ0ββ
The transformer made training parallel and unlocked scaling laws for pre-training. Inference is still sequential. Diffusion is the evolution for inference.
Will the next decade of LLMs run on autoregression, or on diffusion? One of the top questions we got at MLSys this week. Part 6, the final part of our founder story series with @timt at @MenloVentures. Featuring @StefanoErmon, @adityagrover_, @volokuleshov
5:32 PM Β· May 22, 2026 Β· 2.6K Views
6:58 PM Β· May 22, 2026 Β· 847 Views