/Tech4h ago

New Paper Predicts LLM Compositional Errors From Feature Geometry

1000404

Original post

Our ICML mechinterp workshop paper demonstrates how feature geometry can lead to model failures, and analyzing that geometry can help us to efficiently build adversarial test sets based on concept combinations.

Naomi Saphra@nsaphra

We don’t always know what problems are hard for LLMs. So devs evaluate on tasks HUMANS find hard or on broad benchmarks. What if we could instead anticipate which scenarios a model will fail on—all without evaluating specific input examples?

🧵NEW PAPER by @jenniferlumeng &al

2:59 PM · Jun 28, 2026 · 204 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

Naomi Saphra@nsaphra

Our ICML HiLD workshop paper shows that the reason why bigger models learn more complex tasks is because they are able to saturate the gradients for easier tasks; different tasks are competing for the same parameters and gradient mass.

Christopher Potts@ChrisGPotts

We take for granted that larger models are better than smaller ones, but why is this so? Our new paper, led by Jing Huang and @EkdeepL, traces this to a data-induced competition for resources (neurons), using formal analysis, idealized tasks, and real pretraining.

4h20000