/Tech3h ago

Researchers Showcase Six Papers at ICML 2026 in Seoul

3330131.7K

Original post

If you're heading to #ICML2026, you should definitely go meet my collaborators and check out our work! Highlight 🧵 for 2 main track papers, 2 position papers, and 2 workshop preprints!

2:59 PM · Jun 28, 2026 · 1.1K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS359BOOKMARKS2LIKES1REPLIES1

Naomi Saphra@nsaphra

Our ICML paper on random factors in emergence reveals that whenever we observe ABRUPT "emergence" of a capability at scale, we're actually sampling from a multimodal distribution which is GRADUALLY changing the probability of the capability with scale.

Naomi Saphra@nsaphra

Ever looked at LLM skill emergence and thought 70B parameters was a magic number? Our new paper shows sudden breakthroughs are samples from bimodal performance distributions across seeds. Observed accuracy jumps abruptly while the underlying accuracy DISTRIBUTION changes slowly!

3h35912

Naomi Saphra@nsaphra

Our ICML paper on verbalization techniques for interp reveals that current verbalizers (and verbalizer evaluations) fail to consider if an interpretation reflects the verbalizer's background knowledge rather than the subject model's processing.

3h911

Naomi Saphra@nsaphra

Our ICML mechinterp workshop paper demonstrates how feature geometry can lead to model failures, and analyzing that geometry can help us to efficiently build adversarial test sets based on concept combinations.

3h85

Naomi Saphra@nsaphra

Our ICML oral (Tuesday!) position paper argues that empirical science of DL and future model improvements both rely on understanding the training process, not just analyzing or manipulating a fully trained model.

3h251

Naomi Saphra@nsaphra

Our ICML position paper argues that interpretability not only SHOULD be actionable, but CAN be---just follow our framework for targeting actionable insights.

3h20

Naomi Saphra@nsaphra

Our ICML HiLD workshop paper shows that the reason why bigger models learn more complex tasks is because they are able to saturate the gradients for easier tasks; different tasks are competing for the same parameters and gradient mass.

3h91