/Tech1d ago

AI Interpretability Researchers Explore Describing Vision Neurons With Words

131824510373.1K

Original post unavailable.

Sentiment

Users are thanking the Enigma Project team and collaborators for their research using vision-language models to describe vision neurons with words.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

Vedang Lad@vedanglad

1/ We study “digital twins” of macaque V1/V4 -- vision models trained to predict the activity of biological neurons in the primate visual cortex -- and use their outputs to study how the brain structures the world.

1d843

BOOKMARKS1

Vedang Lad@vedanglad

4/ But is the hypothesis right? Generate new images from it → test them on the twin. In V1, this rediscovers the known selectivity for oriented gratings and in V4 the words drive 96.1% of neurons driven above the 95th percentile of natural responses.

1d4831

LIKES4

Vedang Lad@vedanglad

Many thanks to Nikos Karantzas, @kfrankelab, @AToliasLab @naturecomputes @SuryaGanguli @TamarRottShaham and the rest of the team at the Enigma Project!

1d644

RETWEETS2REPLIES1

Vedang Lad@vedanglad

7/ Models promise agentic discovery, but rarely define how to uncover and verify findings. Here, researchers work in tandem with models to deepen understanding. Explore more on our website + read the paper 👇

Website: https://enigma-brain.github.io/letting-the-neural-code-speak/ Paper: https://arxiv.org/pdf/2605.12485

1d752

Vedang Lad@vedanglad

5/ Why does this work? Vision, language, and neural activity partially share a common geometry!

1d494

Vedang Lad@vedanglad

2/ Feature selectivity is visually interpretable -- but how can we reliably scale this? We "translate" images to text with a dense caption. While VLMs get stuck on semantics, language provides a human-interpretable discretization of visual input that a powerful LLM can reason over.

1d653

Vedang Lad@vedanglad

6/ This UMAP of V4 neurons, annotated with hypothesis keywords, shows smooth semantic transitions across the population.

1d593

Vedang Lad@vedanglad

3/ Screen 1M+ images on the twin → take each neuron’s most- and least-activating stimuli → distill their captions into one hypothesis.

1d443