/AI9h ago

University of Chicago's Ari Holtzman seeks real-world examples of LLMs performing worse after fine-tuning on their own distribution

Stella Biderman suggested conflicting evidence causes this misgeneralization.

10011.1K

Original posts

#511

Comments

#208

Original post

Ari Holtzman@universeinanegg#511inAI

For a given LLM there must be certain data drawn from distribution D, such that if you finetune on them the LLM performs worse on D. It misgeneralizes, due to its priors, as we all do. Are there any interesting cases of this that don't feel totally adversarial and artificial?

9:58 PM · Jun 1, 2026 · 717 Views

/AI9h ago

University of Chicago's Ari Holtzman seeks real-world examples of LLMs performing worse after fine-tuning on their own distribution

Stella Biderman suggested conflicting evidence causes this misgeneralization.

--0--

Original posts

#511

Comments

#208

Original post

Ari Holtzman@universeinanegg#511inAI

9:58 PM · Jun 1, 2026 · 717 Views

Sentiment

Sentiment unavailable for this story.

Cluster Engagement

Sentiment

Sentiment unavailable for this story.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Posts from X

Most Activity

Stella Biderman@BlancheMinerva

@universeinanegg Showing the model contrary evidence to one of its beliefs?

Ari Holtzman@universeinanegg

9h40800