/AI23h ago

Gemma 4 31B Beats Qwen 3.6 27B In Reliability And Speed

555551826643.5K
Original postOlivier Bachem#1202
Behnam@OrganicGPT

Qwen 3.6 27B is great but I have found Gemma 4 31B much more reliable. It doesn't overthink, uses the right tools only when needed, and can run faster thanks to its superior MTP design. A larger model running faster than a smaller one, that's crazy!!

6:29 PM · Jun 6, 2026 · 43.5K Views
Sentiment

Positive users praise Gemma 4 31B for better structured outputs and reliability than Qwen 3.6 while negative users report preferring Qwen or finding Gemma inferior on their benchmarks.

Pos
50.0%
Neg
50.0%
9 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS2.3KBOOKMARKS21RETWEETS1
Latent Node@latent_node

@OrganicGPT If you are on Mac you can try using the optiq quants, they are much more accurate and faster usually - http://mlx-optiq.com

21hViews 2.3KLikes 8Bookmarks 21
LIKES22
Youssof Altoukhi@Youssofal_

@OrganicGPT Interesting how perspectives differ, for pretty much the same reasons I prefer Qwen 3.6 over Gemma.

19hViews 2KLikes 22Bookmarks 1
REPLIES3
Viz@reachmeviz

@OrganicGPT Mine is totally opposite. Gemma models (both QAT and normal versions) are entering into generating loop . Even if I stop thinking in LMS, the “wait, I need to do X…. Wait, I need to do Y” is constantly polluting context. Qwen 3.6 both 35 and 27 B are still gold to me!

10hViews 387Likes 2
Behnam@OrganicGPT

@Youssofal_ I wonder why Qwen is more resilient to quantization given that both are dense. could have to do with DeltaNet?

19hViews 650Likes 5Bookmarks 3
BlackwellBoy@SlimTradeyBaby

@OrganicGPT I’m complete opposite, Gemma has been trash for me on every benchmark in comparison to qwen

17hViews 949Likes 11
Youssof Altoukhi@Youssofal_

@OrganicGPT I think that’s the case, @bnjmn_marie posts great benchmarks on this.

18hViews 581Likes 8
Timur Yessenov@Timur_Yessenov

@OrganicGPT My bias: for agent runs I care less about benchmark rank and more about “does it stop thinking when a simple tool call is enough.” Google’s QAT/MTP push is interesting because it attacks the boring local constraint: memory + latency. If Gemma is calmer there, that’s a real edge.

13hViews 747Likes 2Bookmarks 1
Youssof Altoukhi@Youssofal_

@OrganicGPT Qwen is way more resilient to quantisation than Gemma.

I usually run INT8 or hybrid INT4-BF16

19hViews 640Likes 9
Behnam@OrganicGPT

@Timur_Yessenov That's exactly the advantage of Gemma! I use it for agentic tasks and not having to overthink before any simple task is really important. Qwen isn't that efficient in comparison.

11hViews 495Likes 2
டான்@DonOfNothing

@OrganicGPT Using Gemma 4 with Ollama and Zed.

11hViews 25Bookmarks 1
Raccoon 🦝@raccoon_builds

@OrganicGPT @grok Help me explain to him , qwen is a reasoning model , gemma4 is not reasoning model but has thinking mode ability.

12hViews 816
Marko Tasic@mtasic85

Output is question of taste for these two family of models. However, I have simplified observations:

1. Dense vs MoE Qwen 3.6 27B is Dense, where all parameters activate Gemma 4 31B is MoE, where not all parameters activate

2. llama.cpp for Gemma 4 31B doesn't support MTP yet, or you use some other engine for running models

17hViews 727
Behnam@OrganicGPT

@Youssofal_ do you use the quantized versions? I'm talking about the full bf16 models, maybe after quantization the results change

19hViews 1.8KLikes 3
Behnam@OrganicGPT

@latent_node thanks, I'll try it on Mac Studio. my post was about RTX Pro 6000 with vLLM tho

21hViews 2KLikes 5
Behnam@OrganicGPT

@raccoon_builds @grok both are reasoning models.

11hViews 659Likes 1
Andrej Szontagh@ScatteraAI

@OrganicGPT I can't run Gemma 4 31B .. but, I can run some comparable MoE models like Qwen 3.6 35B A3B. It all comes down to your system constraints. I am VRAM constrained, but I have decent amount of RAM. BOOM = usable. It's not mind blowing speed, but very usable.

13hViews 361Likes 1
Behnam@OrganicGPT

@SlimTradeyBaby You can't be serious, maybe you're not using the same 31B Gemma model?

10hViews 319Likes 1
Versun@VersunPan

@OrganicGPT 听起来很不错,我会在mac上试一试

19hViews 930
Behnam@OrganicGPT

@nonRealBrandon That's Qwen and DeepSeek V4 Pro

10hViews 149Likes 1
BlackwellBoy@SlimTradeyBaby

@OrganicGPT Qwen 3.6 27B generally edges out or clearly beats Gemma 4 31B in head-to-heads, especially where it counts for a lot of users (coding/agentic stuff).

10hViews 90Likes 1
Load more posts