/AI9h ago

MLX-VLM Optimizes Gemma 4 12B For Local Apple Silicon Inference

429187559883K

Quote posts

Reposts

#1188

Original post

Olivier Bachem#1188

Dmitry Lyalin@LyalinDotCom

First, amazing to see MLX optimization launch! second, Prince does a great job highlighting the top model features. You get SO much with this model.

Prince Canuma@Prince_Canuma

🚀 Gemma 4 12B is here!

We partnered with @GoogleDeepMind to bring and optimize their new dense and unifed multimodal model for Apple Silicon.

◈ 12B dense · 256K context ◈ Thinking mode (built-in reasoning) ◈ Vision: dynamic res, OCR, UI + charts ◈ Native audio: ASR + speech translation ◈ Function calling for agents ◈ Text + image + audio, interleaved

Runs local. Get started now ⚡

> uv pip install -U mlx-vlm

https://github.com/Blaizzy/mlx-vlm

11:44 AM · Jun 3, 2026 · 3.1K Views

/AI9h ago

MLX-VLM Optimizes Gemma 4 12B For Local Apple Silicon Inference

--0--

Quote posts

Reposts

#1188

Original post

Olivier Bachem#1188

Dmitry Lyalin@LyalinDotCom

First, amazing to see MLX optimization launch! second, Prince does a great job highlighting the top model features. You get SO much with this model.

Prince Canuma@Prince_Canuma

🚀 Gemma 4 12B is here!

We partnered with @GoogleDeepMind to bring and optimize their new dense and unifed multimodal model for Apple Silicon.

Runs local. Get started now ⚡

> uv pip install -U mlx-vlm

https://github.com/Blaizzy/mlx-vlm

11:44 AM · Jun 3, 2026 · 3.1K Views

Sentiment

Many users praised the MLX optimization of Gemma 4 12B for enabling efficient local multimodal inference on Apple Silicon and expressed thanks plus excitement to try it, while a few called the model unreliable or underperforming in benches.

Pos

90.6%

Neg

9.4%

28 comments with sentiment.

Cluster Engagement

Sentiment

Sentiment building, check back later.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Posts from X

Most Activity

RETWEETS47

Prince Canuma@Prince_Canuma

🚀 Gemma 4 12B is here!

We partnered with @GoogleDeepMind to bring and optimize their new dense and unifed multimodal model for Apple Silicon.

Runs local. Get started now ⚡

> uv pip install -U mlx-vlm

https://github.com/Blaizzy/mlx-vlm

Google Gemma@googlegemma

Meet Gemma 4 12B!

A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.

Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇

10h83.7K929609