/AI4h ago

Jing Huang and Chris Potts find larger language models outperform smaller ones by reducing task interference and neuron competition

Smaller models suffer from severe task interference during training.

--0--
Quote posts
Reposts
Original post
Tomek Korbak@tomekkorbak#1178inAI

such a good twitter thread!

Christopher Potts@ChrisGPotts

We take for granted that larger models are better than smaller ones, but why is this so? Our new paper, led by Jing Huang and @EkdeepL, traces this to a data-induced competition for resources (neurons), using formal analysis, idealized tasks, and real pretraining.

10:28 AM · Jun 3, 2026 · 2.2K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
RETWEETS114
Christopher Potts@ChrisGPotts

We take for granted that larger models are better than smaller ones, but why is this so? Our new paper, led by Jing Huang and @EkdeepL, traces this to a data-induced competition for resources (neurons), using formal analysis, idealized tasks, and real pretraining.

2dViews 102KLikes 780Bookmarks 691