Google DeepMind releases quantized Gemma 4 models, using quantization-aware training to compress the smallest variant to 0.84 GB · Digg