3h agoAnalysis argues diffusion models could scale better than autoregressive models as FLOPS cheapen relative to memory bandwidthCartwheel's Andrew Carr praised the discussion over arXiv papers.SentimentSentimentPos100%Neg0%Users praise @andrew_n_carr for freely sharing valuable insights comparing autoregressive versus diffusion models because it helps save compute.1 comment with sentiment. View comments.