Users are excited about Gemma 4 12B achieving a 1.72× speedup on M3 Ultra via MLX-VLM because it shows effortless high performance on Apple Silicon, while some worry about model size and lower speeds on devices with less unified memory.
7 comments with sentiment.