when you put a company like DeepSeek under GPU restrictions, they invent a way to boost their throughput by 51% to even 400%.
now it makes sense how DeepSeek-V4-Pro is: > ~28x cheaper than Opus 4.8 > ~34x cheaper than GPT 5.5
WHAT DOESN'T KILL YOU MAKES YOU STRONGER.
DeepSeek just released DSpark for V4 Flash & Pro, a new speculative decoding method boosting throughput by 51% to 400%!
DS also showed DSpark works well for other models like Gemma & Qwen
Github: https://github.com/deepseek-ai/DeepSpec Paper: https://github.com/deepseek-ai/DeepSpec/blob/main/DSpark_paper.pdf HF: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-DSpark









