/AI2h ago

NVIDIA releases Nemotron 3 Ultra, a 550-billion parameter open-weight hybrid Mamba2-Transformer MoE model

It features 55 billion active parameters and multi-token prediction.

--0--
Original post
Bryan Catanzaro@ctnzr#454inAI

During the past 6 months, Nemotron has grown from 24 to 48 on the AAI, and we're just getting started.

NVIDIA Nemotron 3 Ultra is now live!

Frontier accuracy, 5X greater speed, 30% lower cost.

Deploy however you need - on-premise, on the cloud, or at the edge.

Model is live on HuggingFace under the OpenMDW 1.1 license.

https://www.youtube.com/watch?v=D8LIIvQVGS4

5:42 AM · Jun 4, 2026 · 1.5K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
VIEWS73.4KBOOKMARKS272LIKES1KRETWEETS144REPLIES65
NVIDIA AI@NVIDIAAI

Today we're shipping Nemotron 3 Ultra.

A 550B MoE frontier-intelligence open model built for long-running agents.

It delivers 5x faster inference and lowers the cost of complex agentic tasks by up to 30% versus other open frontier models.

2hViews 73.4KLikes 1KBookmarks 272
NVIDIA releases Nemotron 3 Ultra, a 550-billion parameter open-weight hybrid Mamba2-Transformer MoE model · Digg