12h ago

Nvidia releases Sana-WM open-source world model

1768535384115.1K

——0——

Nvidia released SANA-WM, a 2.6 billion parameter open-source world model for generating controllable video and simulated environments. The model produces up to 60-second 720p videos from a single image, text prompt, and 6-DoF camera controls while maintaining physics consistency. It runs locally on consumer GPUs such as the RTX 5090, generating clips in around 34 seconds after training on public datasets.

Original post

Sergey Karayev#1140@SERGEYKARAYEV

I don’t understand how this can be 2.6B params

3:51 PM · May 16, 2026

Cluster engagement

23 snapshots

QUOTE POST

#867Alexander Doria@DORIALEXANDER

Great Nvidia release but maybe it should be time to admit "world model" is more about intent than architecture. This is generative video with controls, not related to JEPA.

7:24 AM · May 17, 2026 · 1.9K Views

QUOTE POST

#867Alexander Doria@DORIALEXANDER

Aside from the fact they cannot easily manipulate visuals, no reason not to extend this to language AR models/agents

7:40 AM · May 17, 2026 · 610 Views

QUOTE POST

#1140Sergey Karayev@SERGEYKARAYEV

I don’t understand how this can be 2.6B params

10:51 PM · May 16, 2026 · 114.7K Views