/AI9h ago

University of Chicago's Ari Holtzman seeks real-world examples of LLMs performing worse after fine-tuning on their own distribution

Stella Biderman suggested conflicting evidence causes this misgeneralization.

--0--
Original posts
Comments
Original post
Ari Holtzman@universeinanegg#511inAI

For a given LLM there must be certain data drawn from distribution D, such that if you finetune on them the LLM performs worse on D. It misgeneralizes, due to its priors, as we all do. Are there any interesting cases of this that don't feel totally adversarial and artificial?

9:58 PM · Jun 1, 2026 · 717 Views
Sentiment
Sentiment unavailable for this story.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
VIEWS408
Stella Biderman@BlancheMinerva

@universeinanegg Showing the model contrary evidence to one of its beliefs?

Ari Holtzman@universeinanegg

For a given LLM there must be certain data drawn from distribution D, such that if you finetune on them the LLM performs worse on D. It misgeneralizes, due to its priors, as we all do. Are there any interesting cases of this that don't feel totally adversarial and artificial?

9hViews 408Likes 0Bookmarks 0