Nvidia-quantized Gemma-4-26B-A4B MoE runs 16 parallel streams at 300 tokens per second on a single DGX Spark · Digg