12h ago

Nvidia releases Sana-WM open-source world model

0

Nvidia released SANA-WM, a 2.6 billion parameter open-source world model for generating controllable video and simulated environments. The model produces up to 60-second 720p videos from a single image, text prompt, and 6-DoF camera controls while maintaining physics consistency. It runs locally on consumer GPUs such as the RTX 5090, generating clips in around 34 seconds after training on public datasets.

Original post

I don’t understand how this can be 2.6B params

3:51 PM · May 16, 2026 View on X

Great Nvidia release but maybe it should be time to admit "world model" is more about intent than architecture. This is generative video with controls, not related to JEPA.

7:24 AM · May 17, 2026 · 1.9K Views

Aside from the fact they cannot easily manipulate visuals, no reason not to extend this to language AR models/agents

7:40 AM · May 17, 2026 · 610 Views
Nvidia releases Sana-WM open-source world model · Digg