/AI6h ago

NVIDIA releases Nemotron-3-Ultra-550B-A55B, an open-weights reasoning model running at 420.2 tokens per second on Blackbox AI

API pricing is $1.08 per million output tokens.

--0--
Quote posts
Reposts
Original postBryan Catanzaro#434
BLACKBOX AI@blackboxai

420.2 tok/s on a 550B model. ⚡️

Nemotron-3-Ultra-550B-A55B reaches 420.2 tok/s powered by BLACKBOX AI Inference Engine.

Blackbox now delivers the fastest inference in the industry, outperforming every other provider, including on smaller-parameter models.

Check our blog in the comments.

7:45 AM · Jun 4, 2026 · 11.5K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
No ranked X posts are available for this story yet.