/Tech26m ago

DeepSeek V4-Pro Delivers 3x Cheaper Inference Than Prior Models

0100414

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#501inTech

What that could mean? Let's say 10 million trajectories 100K tokens long, or 625K GRPO groups of @16. They used batch = 512 for R1, so that's enough for 1220 steps, for 10K in 8 days. This is all very conservative.

@stochasticchasm @willccbb what's the actual RL economics now?

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

This also gives me plenty of hope for V4.1 and beyond Consider, they don't need max speed inference for async RL rollouts. V4-Flash does… like 14K tokens/GPU at 100 tps. If that's 950DT, then one SuperPOD = 5T tokens/day. Or at least 1T at 20% utilization. data machine go brrr

5:17 AM · Jun 27, 2026 · 202 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS447LIKES1

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

correction, that's at 10% utilization (14*8192)*3600*24=9,9T

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

22m44710