Ok if I really follow DeepSeek recipes synth gen at scale is literally a pretrain job.
Alexander Doria@Dorialexander
Optimized large model inference is really just pretraining with extra steps.
1:26 PM · Jun 23, 2026 · 3.3K Views
Ok if I really follow DeepSeek recipes synth gen at scale is literally a pretrain job.
Optimized large model inference is really just pretraining with extra steps.
No Digg Deeper questions have been answered for this story yet.
I also get why they finally dropped the 7B for the speciale recipe. Sparse MoE economics means that large model inference is the better trade-off.
Ok if I really follow DeepSeek recipes synth gen at scale is literally a pretrain job.