/AI7h ago

NVIDIA Reduces SOCAMM Capacity For Rubin To Ease DRAM Demand

3656344166133.5K

Original post

Zephyr@zephyr_z9#1471inAI

The fall is so stupid U don't really need 1.5TB per CPU for training, chatbot interactions, and light agentic workloads Provisioning that much DRAM for them will just lead to underutilization and waste of resources (500GB-750GB per CPU is enough) My biggest fear was Nvidia being unable to sell finished Rubin NVL72 racks becuz of the DRAM shortage Now they can handle demand in a much better way

2:52 AM · Jun 6, 2026 · 109.3K Views

Sentiment

Many users welcomed NVIDIA's SOCAMM capacity cuts and dual DRAM options for Rubin as a smart bullish move easing DRAM demand, while a few called the changes dumb.

Pos

85.7%

Neg

14.3%

9 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS24.2KBOOKMARKS29LIKES163RETWEETS11REPLIES7

Zephyr@zephyr_z9

Nvidia is offering dual configs 1.5TB per CPU for racks assigned to agentic workloads 750GB per CPU for the rest of the stuff

Zephyr@zephyr_z9

6h24.2K16329

beroccan@beroccan

@zephyr_z9 Sk hynix prob way more down than this since Korea closed before the blood started. The European ADR closed -20,53% yesterday

6h4976

The Oracle@CifrOracle

@zephyr_z9 For those who need a cheat sheet btw this is VERY BULLISH $MU

5h6614

GalleonCrypto@GalleonCrypto

@cryptoendgamer @zephyr_z9 Okay $7k level before aping

5h261

GalleonCrypto@GalleonCrypto

@zephyr_z9 So resume up next week?

7h1412

T_wack669@t_wack669

@zephyr_z9 Many stocks went down 10%, not just memory stocks. Out of curiosity, how are correlating this specific news to the broader selloff?

5h3341

ひろたん@AI-FX@cA5YtiJ0YD4888

@zephyr_z9 @paurooteri 秋？ってなったけど文脈からThe fall の誤訳と思ったら案の定…w

6h3031

Bubba G@BubbaGde

@zephyr_z9 I think a lot of it has to do with temporary mechanical leverage unwinding rather than any fundamentals

7h2231

GOD DID@izzychamp2024

@zephyr_z9 Panic sell-off amidst all this SpaceX shit we will be back soon

6h1221

让长风使尽@Rangfeng1117

@zephyr_z9 Agentic workloads need massive memory for multi-step reasoning context. Nvidia's split makes sense: prioritize bandwidth for inference, not storage. Shows they're optimizing for LLM memory walls, not GPU count. Smart segmentation.

6h871

Jake Ayes@jake_ayes

@zephyr_z9 I mean it meaningfully changes supply demand forecasting if you don’t need 1.5TB per CPU but I hear you

3h841

░\_/TT\_/░@pers0naluni0n

@zephyr_z9 It is time for On-Prem.

All companies I know don't want to donate their data lake to anthropic / openai / amazon / microsoft

Signing LOI with the cloud service providers fell of a cliff.

I see 100th and 1000th of small installations.

6h134

Paco@Pacoxbt

@zephyr_z9 The utilization point makes sense for current workloads

Wonder how this shapes infrastructure design as models keep scaling

7h128

ZeroMazed@ZeroMazed

@zephyr_z9 damn right they don't need 1.5tb per cpu for basic chat. my tracker caught this supply chain shift early.

7h126

Avangardem@avangardem

@zephyr_z9 So you crashed the market for nothing? :D

5h103

GOD DID@izzychamp2024

@zephyr_z9 Obvious all of this is VERY dumb

6h97

Newtee@Newtlx

@zephyr_z9 Provisioning 1.5tb per cpu for chatbots is overkill

underutilization wastes resources and inflates costs efficiency gains from right sizing hardware are significant for scaling ai workloads

7h93

Jamie Follese@JamieFollese

@zephyr_z9 Written October 11th. https://open.substack.com/pub/jamiefollese/p/o3o2o1?r=22e59f&utm_medium=ios nobody cares until it’s over as usual

3h81

Jamie Follese@JamieFollese

@zephyr_z9 Yep, been a matter of time for quite some time. Just wasn’t a popular position to take until about a week ago. https://open.substack.com/pub/jamiefollese/p/the-era-of-anti-scaling-is-here?r=22e59f&utm_medium=ios

3h76

Azael@theazaelov

@zephyr_z9 750 per CPU for general workloads is realistic. the dual config move makes the waste optional which is actually smart

4h34