Post training deepseek with a 1k gpu Huawei cluster -- no nvidia -- feels like a serious milestone to me
"Intelligence too cheap to meter" cant arrive if there's an nvidia/tsmc bottleneck, even if its better for shareholders
Post training deepseek with a 1k gpu Huawei cluster -- no nvidia -- feels like a serious milestone to me
"Intelligence too cheap to meter" cant arrive if there's an nvidia/tsmc bottleneck, even if its better for shareholders
This is full parameter post training, no shortcuts. Probably good enough at last for a real training run
Post training deepseek with a 1k gpu Huawei cluster -- no nvidia -- feels like a serious milestone to me
"Intelligence too cheap to meter" cant arrive if there's an nvidia/tsmc bottleneck, even if its better for shareholders
This is the only way we get a Chinese mythos imo. Just hope its open source.
This is full parameter post training, no shortcuts. Probably good enough at last for a real training run

@chris_j_paxton wait so the actually important part is that they trained it at all with that hardware, not just the no-nvidia headline
Post training deepseek with a 1k gpu Huawei cluster -- no nvidia -- feels like a serious milestone to me
"Intelligence too cheap to meter" cant arrive if there's an nvidia/tsmc bottleneck, even if its better for shareholders