/Tech1h ago

ICML position paper argues AI research relies too heavily on post-training patches instead of studying training dynamics

Yoav Goldberg endorsed the paper's 'fix it in post' analogy.

132793711528.3K
Original post
Stella Biderman@BlancheMinerva

In film, "we'll fix it in post" is what you say when something went wrong on set and you don't want to redo it. AI research has made it our entire methodology: train the model, then patch whatever comes out. Our new ICML oral argues this can't be the basis of a science of AI. 🧵

12:08 PM · Jun 10, 2026 · 26.8K Views
Sentiment

Many users praised the ICML paper's argument for studying AI training dynamics rather than post-hoc fixes, citing strong agreement with the framing and excitement to read and share the work.

Pos
100.0%
Neg
0.0%
5 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS1.9KBOOKMARKS7LIKES42RETWEETS7
Stella Biderman@BlancheMinerva

Read the full paper: https://arxiv.org/abs/2606.06533 or come listen to our oral @icmlconf!

Huge thanks to my co-authors @aflah02101 @niloofar_mire @linguist_cat @FazlBarez @nsaphra

Stay tuned for a related workshop (hopefully) at NeurIPS too!

20hViews 1.9KLikes 42Bookmarks 7
REPLIES2
Stella Biderman@BlancheMinerva

We ground discussion in the history and philosophy of science. What did it take for other fields to move from cataloging phenomena to predicting and controlling them? AI can learn from that playbook.

20hViews 690Likes 20
Stella Biderman@BlancheMinerva

Post hoc analysis can certainly be useful, especially if you’re primarily concerned with the behavior of a specific deployed model. But looking at a static model will not tell you why the model developed a behavior. The real causal story must go back to the training process.

20hViews 1KLikes 24Bookmarks 1
Stella Biderman@BlancheMinerva

Part of why post hoc analysis dominates: it's the only thing most researchers CAN do. Almost no one releases intermediate checkpoints or training data. we built MultiBERT and Pythia to set a better standard, and it's been great to see work like OLMo and Marin follow our lead.

20hViews 849Likes 16Bookmarks 1
Stella Biderman@BlancheMinerva

Also, appolgies to @learning_mech and the "There Will Be a Scientific Theory of Deep Learning" team for not engaging with the contents of your paper. I believe I learned about it on the same day we got the ICML acceptance notifications.

18hViews 1.6KLikes 8Bookmarks 2
Stella Biderman@BlancheMinerva

Models are not static objects. They're snapshots of time-evolving processes shaped by data, objectives, architectures, and optimization. But most research treats them as fixed artifacts, analyzing behaviors after training instead of asking why they emerged.

20hViews 1.2KLikes 24
Stella Biderman@BlancheMinerva

A test for progress: a science of AI should support progressively stronger forms of understanding: 1. Predict outcomes from early training signals 2. Intervene to correct trajectories on undesirable paths 3. Design training procedures that reliably produce desired properties

20hViews 623Likes 18
Stella Biderman@BlancheMinerva

A common issue with position papers is that they leave the reader wondering “okay, but what should I actually do”? To address this we provide open problems on a wide variety of topics throughout to illustrate our perspectives and guide future research

20hViews 606Likes 18
Jeremy Howard@jeremyphoward

@BlancheMinerva I love this framing :)

19hViews 1.8KLikes 11
Stella Biderman@BlancheMinerva

From what I've skimmed I think we're in agreement about a lot of things, but I'm excited to find time to read it closely :)

18hViews 1.3KLikes 3Bookmarks 1
Guilherme O'Tina@guilhermeotina

honestly the framing underrates how much we learned from fixing in post. scaling laws, grokking, induction heads, even rlhf all came from post-hoc analysis of already-trained models, not from planned instrumentation during training runs. you can only measure what you thought to look for, and the good surprises are usually the ones you didnt plan

19hViews 47
Yuval Pinter@yuvalpi

@yoavartzi why not from before pre-training, i.e. from the token vocab?

8hViews 39
Edward Raff@EdwardRaffML

@BlancheMinerva Does latex handle BCE bib entires well?

16hViews 21
Yoav Artzi@yoavartzi

+1 and more: we should train models to be the way we want them to be -- as in from pre-training, not with post-hoc brittle patching. For this, we must scale down, and allow speculative research trajectories to flourish, rather than squash them before they get a chance

Stella Biderman@BlancheMinerva

In film, "we'll fix it in post" is what you say when something went wrong on set and you don't want to redo it. AI research has made it our entire methodology: train the model, then patch whatever comes out. Our new ICML oral argues this can't be the basis of a science of AI. 🧵

11hViews 1.7KLikes 19Bookmarks 2
Turing@turingcom

@BlancheMinerva @icmlconf @Aflah02101 @niloofar_mire @linguist_cat @FazlBarez @nsaphra Going to share this with the team!👏

19hViews 177Likes 1

@BlancheMinerva Le "post", c'est ce qu'on appelle une commission d'enquête. Les résultats sont excellents, on a même trouvé des fonds secrets dans le dataset. 🇫🇷

16hViews 123
Stella Biderman@BlancheMinerva

@guilhermeotina Yes, if we said that we would be very silly. But that's not what we're talking about. Scaling laws, grokking, and induction heads are some of the best examples of the kind of work we are advocating for.

19hViews 24Likes 1
Maya N@mayasolos

@BlancheMinerva Very here for this. If we keep doing 'fix it in post,' we’re doing damage control, not building a science of AI.

12hViews 33
Yoav Artzi@yoavartzi

@yuvalpi Fair enough. I guess I see tokenization as part of the pre-training process to start with. But, good point, given the default is just picking up on 1-2 pre-trained tokenizers regardless of your data

1hViews 5Likes 1
Stella Biderman@BlancheMinerva

@EdwardRaffML It doesn’t really need to… None of the bibliography formats that I use sort by date as a numeric or anything like that.

16hViews 14