/Tech18d ago

Gemma 4 31B Beats Qwen 3.6 27B In Reliability And Speed

--0--

#1210

Original post

Olivier Bachem#1210

Behnam@OrganicGPT

Qwen 3.6 27B is great but I have found Gemma 4 31B much more reliable. It doesn't overthink, uses the right tools only when needed, and can run faster thanks to its superior MTP design. A larger model running faster than a smaller one, that's crazy!!

6:29 PM · Jun 6, 2026 · 49.6K Views

Sentiment

Positive users praise Gemma 4 31B for better structured outputs and reliability than Qwen 3.6 while negative users report preferring Qwen or finding Gemma inferior on their benchmarks.

Pos

50.0%

Neg

50.0%

9 comments with sentiment.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS2.3KBOOKMARKS21RETWEETS1

Latent Node@latent_node

@OrganicGPT If you are on Mac you can try using the optiq quants, they are much more accurate and faster usually - http://mlx-optiq.com

18d2.3K821

LIKES22

Youssof Altoukhi@Youssofal_

@OrganicGPT Interesting how perspectives differ, for pretty much the same reasons I prefer Qwen 3.6 over Gemma.

18d2K221

REPLIES3

Viz@reachmeviz

@OrganicGPT Mine is totally opposite. Gemma models (both QAT and normal versions) are entering into generating loop . Even if I stop thinking in LMS, the “wait, I need to do X…. Wait, I need to do Y” is constantly polluting context. Qwen 3.6 both 35 and 27 B are still gold to me!

18d3872

Behnam@OrganicGPT

@Youssofal_ I wonder why Qwen is more resilient to quantization given that both are dense. could have to do with DeltaNet?

18d65053

BlackwellBoy@SlimTradeyBaby

@OrganicGPT I’m complete opposite, Gemma has been trash for me on every benchmark in comparison to qwen

18d94911

Youssof Altoukhi@Youssofal_

@OrganicGPT I think that’s the case, @bnjmn_marie posts great benchmarks on this.

18d5818

Timur Yessenov@Timur_Yessenov

@OrganicGPT My bias: for agent runs I care less about benchmark rank and more about “does it stop thinking when a simple tool call is enough.” Google’s QAT/MTP push is interesting because it attacks the boring local constraint: memory + latency. If Gemma is calmer there, that’s a real edge.

18d74721

Youssof Altoukhi@Youssofal_

@OrganicGPT Qwen is way more resilient to quantisation than Gemma.

I usually run INT8 or hybrid INT4-BF16

18d6409

Behnam@OrganicGPT

@Timur_Yessenov That's exactly the advantage of Gemma! I use it for agentic tasks and not having to overthink before any simple task is really important. Qwen isn't that efficient in comparison.

18d4952

டான்@DonOfNothing

@OrganicGPT Using Gemma 4 with Ollama and Zed.

18d251

Raccoon 🦝@raccoon_builds

@OrganicGPT @grok Help me explain to him , qwen is a reasoning model , gemma4 is not reasoning model but has thinking mode ability.

18d816

Marko Tasic@mtasic85

Output is question of taste for these two family of models. However, I have simplified observations:

1. Dense vs MoE Qwen 3.6 27B is Dense, where all parameters activate Gemma 4 31B is MoE, where not all parameters activate

2. llama.cpp for Gemma 4 31B doesn't support MTP yet, or you use some other engine for running models

18d727

Behnam@OrganicGPT

@Youssofal_ do you use the quantized versions? I'm talking about the full bf16 models, maybe after quantization the results change

18d1.8K3

Behnam@OrganicGPT

@latent_node thanks, I'll try it on Mac Studio. my post was about RTX Pro 6000 with vLLM tho

18d2K5

Behnam@OrganicGPT

@raccoon_builds @grok both are reasoning models.

18d6591

Andrej Szontagh@ScatteraAI

@OrganicGPT I can't run Gemma 4 31B .. but, I can run some comparable MoE models like Qwen 3.6 35B A3B. It all comes down to your system constraints. I am VRAM constrained, but I have decent amount of RAM. BOOM = usable. It's not mind blowing speed, but very usable.

18d3611

Behnam@OrganicGPT

@SlimTradeyBaby You can't be serious, maybe you're not using the same 31B Gemma model?

18d3191

Versun@VersunPan

@OrganicGPT 听起来很不错，我会在mac上试一试

18d930

Behnam@OrganicGPT

@nonRealBrandon That's Qwen and DeepSeek V4 Pro

18d1491

BlackwellBoy@SlimTradeyBaby

@OrganicGPT Qwen 3.6 27B generally edges out or clearly beats Gemma 4 31B in head-to-heads, especially where it counts for a lot of users (coding/agentic stuff).

18d901