There is still a lot to learn about how the microscopic organization of large AI models - at the level of individual neurons - changes quantitatively with scale and across diverse training settings. @_AmilDravid uncovers new features of universal neurons connecting directly to semantic structure and scaling. I'm excited for the directions emerging from Amil's work!
Scaling laws describe how loss changes with scale. Do neurons inside models change predictably too?
We study vision and language models up to 30B params and find systematic scaling in neuron universality, specialization, and selectivity.
Paper+code: https://avdravid.github.io/rosetta-neuron-scaling/
1/n