First, amazing to see MLX optimization launch! second, Prince does a great job highlighting the top model features. You get SO much with this model.
🚀 Gemma 4 12B is here!
We partnered with @GoogleDeepMind to bring and optimize their new dense and unifed multimodal model for Apple Silicon.
◈ 12B dense · 256K context ◈ Thinking mode (built-in reasoning) ◈ Vision: dynamic res, OCR, UI + charts ◈ Native audio: ASR + speech translation ◈ Function calling for agents ◈ Text + image + audio, interleaved
Runs local. Get started now ⚡
> uv pip install -U mlx-vlm
https://github.com/Blaizzy/mlx-vlm