Already reasonably established that it preserves a lot of general capability, interesting to test this on *knowledge* against gpt-oss-120B, as they're actually close in on-disk size.
Antirez (the person who built redis) is now publishing quantized versions of deepseek V4 on huggingface. the technique he’s using is worth understanding even if the model is too big for your GPU.
quick background: quantization is how you shrink a model to fit on smaller










