/Tech8h ago

Developer Teortaxes and contributor Grad debate separating DeepSeek V4 training and inference workloads across multi-cluster systems

Teortaxes proposes running reinforcement learning on separate small servers

1800389

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#450inTech

@Grad62304977 inference is happening on multiple clusters by default I mean the question of how DeepSeek's training playbook described in V4 paper can fit with this domestic system/method. RL on a small separate server would be an adequate way to produce a marginal teacher.

Grad@Grad62304977

@teortaxesTex i dont really see the direct need for MOPD here U can parallelise the inference across clusters as cursor and we did before For training is the main difference but for RL its unlikely u would reach a point of needing this. But also seems u can do this too without MOPD

6:58 PM · Jun 6, 2026 · 293 Views

/Tech8h ago

Developer Teortaxes and contributor Grad debate separating DeepSeek V4 training and inference workloads across multi-cluster systems

Teortaxes proposes running reinforcement learning on separate small servers

1800389

#450

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#450inTech

Grad@Grad62304977

6:58 PM · Jun 6, 2026 · 293 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS23LIKES3

Grad@Grad62304977

@teortaxesTex ya fair just that u could also use these small seperate servers for inference (main workload) and u just have to centralise the training workload (technically dont need to either but still)

8h233