/Tech7h ago

Pleias CTO Pierre-Carl Langlais argues GLM 5.2 includes architectural updates alongside its 128K IndexShare mid-training, which Elie Bakouch disputes

The training recipe also features extensive reinforcement learning.

1110132227.5K

#1136

Original post

Alexander Doria@Dorialexander#1537inTech

Has anyone done any speculation on the training recipe of GLM 5.2? Beyond extensive RL, we know it's (at least?) a new midtrain ("GLM-5.2 is trained with IndexShare from mid-training with 128K sequence length") with arch changes.

9:30 AM · Jun 21, 2026 · 27K Views

Sentiment

Users welcome speculated GLM-5.2 mid-training changes as better news for the open ecosystem because they could shortcut expensive data-building capabilities through synthetic generation.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS2.2K

Alexander Doria@Dorialexander

actually, along with other closed labs signals, doesn't seem great news for custom rl env sellers.

6h2.2K5

BOOKMARKS2

Alexander Doria@Dorialexander

@yacineMTB (likely through the OPD recipe, since they mention it, "efficiently merging more than ten expert models into the final model")

7h6742

LIKES20REPLIES2

Alexander Doria@Dorialexander

Based on the bouba shape, my guess would be hard synth/rl env scaling with recursive generative design+eval.

7h1.7K202

elie@eliebakouch

@Dorialexander (except index sharing ok but they already published this paper and just didn't need it for smaller context)

elie@eliebakouch

@Dorialexander there is no arch changes?

2h31760

elie@eliebakouch

@Dorialexander there is no arch changes?

Alexander Doria@Dorialexander

2h50850

Alexander Doria@Dorialexander

@yacineMTB yeah and diversity/combinations.

7h956

kache@yacineMTB

@Dorialexander when you say RL env scaling, you mean total volume of RL envs right?

7h2784

Alexander Doria@Dorialexander

and, conversely, much better news for the open ecosystem that can maybe shortcut a billion-dollars data building capability by generating it all. though you'll still need hard skills.

6h6965

ChuhaiDev@ChuhaiDev

@Dorialexander I think they explained some of the stuff in their paper on training 5.0

7h2631

Alexander Doria@Dorialexander

@ChuhaiDev ok you were right.

3h162

elie@eliebakouch

@Dorialexander ok i was confused bc you said

> we know it's a new midtrain with arch changes

Alexander Doria@Dorialexander

@eliebakouch None that I have seen. Param count identical.

2h11600

Alexander Doria@Dorialexander

@ChuhaiDev yeah but doesn't really explain the sudden take-off.

7h2323

infrecursion@infrecursion1

@Dorialexander Anthropic will let us know soon through another blog post.

7h1992

Suresh@_Suresh2

@Dorialexander indexshare at 128K in midtrain points to multi-document packing so the model picks up cross-doc signals early.

6h1711

Alexander Doria@Dorialexander

@eliebakouch ah yeah meant light arch changes.

elie@eliebakouch

@Dorialexander ok i was confused bc you said

> we know it's a new midtrain with arch changes

2h7610

umumu@umi33563

@Dorialexander @yacineMTB MAI combined 3 experts into final model via SFT (which I was like, no way this would work? with my limited knowledge I expected OPD to be more robust)

I'm wondering what were expert objectives/curriculums for 5.2. Lot of expert models

7h151

Luke Richey@lukebrichey

@Dorialexander @ChuhaiDev Lol

3h51

Alexander Doria@Dorialexander

@eliebakouch None that I have seen. Param count identical.

2h8

Erika S@E_FutureFan

@Dorialexander I keep wondering how much comes from the midtraining vs the RL phase. Without ablations it's hard to know what's actually moving the needle.

6h3