/Tech40d ago

Grigory Bartosh presents Dual-Rate Diffusion from Google DeepMind to accelerate inference in standard and distilled diffusion models by interleaving a heavy context encoder with a lightweight denoiser

The method lowers computational costs during generation for tested models.

92063013817.8K

#609

Original post

Grigory Bartosh@GrigoryBartosh

🚀 Excited to share my @GoogleDeepMind student researcher project: Dual-Rate Diffusion✨

⚡ A simple construction that speeds up both regular diffusion and distilled models by interleaving a heavy context encoder with a light conditional denoiser.

🧵👇

6:27 AM · May 20, 2026 · 16K Views

Sentiment

Users are excited about Dual-Rate Diffusion from Google DeepMind because it straightforwardly improves quality-compute tradeoffs for regular diffusion models while praising the collaborative research team.

Pos

100.0%

Neg

0.0%

2 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS144REPLIES1

Grigory Bartosh@GrigoryBartosh

📈 It is very straightforward to implement for regular diffusion, where it already improves the quality-compute tradeoff. More importantly, we show that Dual-Rate can accelerate even few-step distilled models like MMD while preserving sample quality gains over the teacher model🤯

40d144

LIKES2

Grigory Bartosh@GrigoryBartosh

🙏 I was lucky to work with a great team: @djjruhe, @emiel_hoogeboom, @JonathanHeek, Thomas Mensink and @TimSalimans

📜 Check out the paper for more details: https://arxiv.org/abs/2605.18190

40d1202

Grigory Bartosh@GrigoryBartosh

💡 The core idea of Dual-Rate is to give the model a high-dimensional space to easily store computation from previous steps. Instead of one large denoising model we use two: a context encoder evaluated sparsely and a light denoiser conditioned on the encoder’s representations.

40d1131

Grigory Bartosh@GrigoryBartosh

🤔 Regular diffusion models generate samples through iterative refinement. Although sample quality improves over time, the model still has to repeat much of the same work at every step just to understand what it is looking at and to build useful features.

40d121

PaperTrace@usepapertrace

@GrigoryBartosh @GoogleDeepMind Neat idea. Do you share code and a simple rerun script, and how does FID or throughput compare to a strong baseline at the same compute? Curious where the gains land.

40d65