If you're heading to #ICML2026, you should definitely go meet my collaborators and check out our work! Highlight 🧵 for 2 main track papers, 2 position papers, and 2 workshop preprints!
No Digg Deeper questions have been answered for this story yet.
Most Activity
Our ICML paper on random factors in emergence reveals that whenever we observe ABRUPT "emergence" of a capability at scale, we're actually sampling from a multimodal distribution which is GRADUALLY changing the probability of the capability with scale.
Ever looked at LLM skill emergence and thought 70B parameters was a magic number? Our new paper shows sudden breakthroughs are samples from bimodal performance distributions across seeds. Observed accuracy jumps abruptly while the underlying accuracy DISTRIBUTION changes slowly!

Our ICML paper on verbalization techniques for interp reveals that current verbalizers (and verbalizer evaluations) fail to consider if an interpretation reflects the verbalizer's background knowledge rather than the subject model's processing.

Our ICML mechinterp workshop paper demonstrates how feature geometry can lead to model failures, and analyzing that geometry can help us to efficiently build adversarial test sets based on concept combinations.

Our ICML oral (Tuesday!) position paper argues that empirical science of DL and future model improvements both rely on understanding the training process, not just analyzing or manipulating a fully trained model.

Our ICML position paper argues that interpretability not only SHOULD be actionable, but CAN be---just follow our framework for targeting actionable insights.

Our ICML HiLD workshop paper shows that the reason why bigger models learn more complex tasks is because they are able to saturate the gradients for easier tasks; different tasks are competing for the same parameters and gradient mass.