And it’s only 40B active / 744B total params…
still can't believe how good glm 5.2 is
GLM-5.2 activates 40 billion of 744 billion parameters.
And it’s only 40B active / 744B total params…
still can't believe how good glm 5.2 is
Users praise Zai_org for consistently crushing it with GLM-5.2's strong results at efficient 40B active parameters.
No Digg Deeper questions have been answered for this story yet.
@Thom_Wolf smollm4 40B / 744B wen? 🥺
And it’s only 40B active / 744B total params…

@Thom_Wolf What are the inactive ones doing?

@Thom_Wolf @Zai_org is absolutely onto something. I’ve thought that since 4.5 air - they just keep crushing it.

@Thom_Wolf we made glm 5.2 pretty fast btw

@Thom_Wolf being surprised at those numbers feels like we havent hit ceiling yet
somewhere a 100B just dropped and nobody blinked

@samuelekpe @Thom_Wolf

@Thom_Wolf

@Thom_Wolf smaller active params usually means more distilled optimization
kinda wild if it actually holds up