2h ago

Former OpenAI Sora researcher Will Depue proposes teacher-forced spectral autoregression to bypass iterative denoising in image generation

Ethan Smith notes the spectral analogy requires specific noise conditions

0
Original post

wait, if diffusion models are effectively implementing spectral autoregression, and image gpt models just do literally right to left autoregression, and image GPT models are taking off, why not literally just do spectral autoregression? playing with turning diffusion models into compressors right now and it’s kind of annoying ti deal with denoising process. a teacher forced spectral autoregression model could be cool i’m sure this already exists right? i guess there are a lot of ways to do order tokens for autoregression, doesn’t need to be spectral

8:04 PM · May 29, 2026 View on X

^left to right. please ignore the laziness with which i tweet

will depuewill depue@willdepue

wait, if diffusion models are effectively implementing spectral autoregression, and image gpt models just do literally right to left autoregression, and image GPT models are taking off, why not literally just do spectral autoregression? playing with turning diffusion models into compressors right now and it’s kind of annoying ti deal with denoising process. a teacher forced spectral autoregression model could be cool i’m sure this already exists right? i guess there are a lot of ways to do order tokens for autoregression, doesn’t need to be spectral

3:04 AM · May 30, 2026 · 9K Views
3:10 AM · May 30, 2026 · 1.5K Views

I tried a pretty naive form of 2D FFT a while back and then several papers have happened since using wavelet decomposition with sparsity tricks.

The spectral AR analogy I think only really holds under specific conditions considering what happens when images, which generally follow a power law spectra of amplitudes, is blended with white noise, which has uniform amplitude across all frequencies.

sweet-hall-e72.notion.site
Mimicking Diffusion Models by Sequencing Frequency Coefficients | Notion
Written by Ethan Smith
will depuewill depue@willdepue

wait, if diffusion models are effectively implementing spectral autoregression, and image gpt models just do literally right to left autoregression, and image GPT models are taking off, why not literally just do spectral autoregression? playing with turning diffusion models into compressors right now and it’s kind of annoying ti deal with denoising process. a teacher forced spectral autoregression model could be cool i’m sure this already exists right? i guess there are a lot of ways to do order tokens for autoregression, doesn’t need to be spectral

3:04 AM · May 30, 2026 · 9K Views
4:12 AM · May 30, 2026 · 269 Views
Former OpenAI Sora researcher Will Depue proposes teacher-forced spectral autoregression to bypass iterative denoising in image generation · Digg