/AI23h ago

Diffusion Transformers Use Morphogen-Like Gradients To Solve Spatial Prompts

4145311037.9K

Original posts

Reposts

#778

Original post

Alex Mordvintsev#778

Binxu Wang 🐱@WangBinxu

1/ When diffusion generates images from text, before an image has objects, how does each noisy token know what it should become?

In our new work, we found that Diffusion Transformers solve spatial-relation prompts using a circuit motif reminiscent of developmental biology: morphogen-like spatial gradients.

At the start of sampling, image tokens are mostly uninformed noise — like an undifferentiated sheet in an embryo. Relation heads then write smooth spatial gradients onto the image canvas, guiding where objects should emerge.

Accepted as a @CVPR 2026 Highlight🌟: http://animadversio.github.io/DiT-Relation-Circuits Beautiful collaboration with my friends and colleagues @fjxdaisy & Xu Pan! A 🧵

8:25 PM · Jun 2, 2026 · 7.9K Views

/AI23h ago

Diffusion Transformers Use Morphogen-Like Gradients To Solve Spatial Prompts

--0--

Original posts

Reposts

#778

Original post

Alex Mordvintsev#778

Binxu Wang 🐱@WangBinxu

1/ When diffusion generates images from text, before an image has objects, how does each noisy token know what it should become?

In our new work, we found that Diffusion Transformers solve spatial-relation prompts using a circuit motif reminiscent of developmental biology: morphogen-like spatial gradients.

Accepted as a @CVPR 2026 Highlight🌟: http://animadversio.github.io/DiT-Relation-Circuits Beautiful collaboration with my friends and colleagues @fjxdaisy & Xu Pan! A 🧵

8:25 PM · Jun 2, 2026 · 7.9K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Sentiment

Sentiment building, check back later.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Posts from X

Most Activity

No ranked X posts are available for this story yet.