/Tech22d ago

Preprint Shows ESM-2 And ESM-3 Capture Mostly The Same Protein Biology

41813512114.1K

Original post unavailable.

Sentiment

Users praised the preprint showing ESM-2 and ESM-3 protein models capture similar biology because they called the finding cool and the summarization great.

Pos

100.0%

Neg

0.0%

4 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS11.3KBOOKMARKS20LIKES47RETWEETS7REPLIES2

🧬Jacob L Steenwyk@jlsteenwyk

Surprisingly, one attention head (L0H7) does most of the work for structure token reasoning

Ablating it alone changes 40% of secondary structure predictions. 10 random layer-0 heads: <17%.

Also happy to hear that @KestenGal observed similarly disproportionate contribution of L0H7 in their article about protein repeats https://arxiv.org/abs/2602.23179

22d11.3K4720

🧬Jacob L Steenwyk@jlsteenwyk

Very excited to share this first exploration into mechanistic interpretability of these fascinating language models -- and looking forward to doing more of this type of work! https://www.biorxiv.org/content/10.64898/2026.05.12.724593v1

22d869113

🧬Jacob L Steenwyk@jlsteenwyk

In summary, two architecturally distinct models that operate across different modalities converge on a shared biological vocabulary

Structure tokens sharpen that vocabulary rather than rewriting it

22d50372

🧬Jacob L Steenwyk@jlsteenwyk

Steering vectors, attribution patching, and sparse feature circuits all mechanistically confirm that these features sit within the model's causal pathway

22d6416

🧬Jacob L Steenwyk@jlsteenwyk

This suggests that ESM-2 already infers structural regularities from sequence alone

Structure tokens don't rewrite the vocabulary. They sharpen representations that the sequence context has already approximated

22d5835

🧬Jacob L Steenwyk@jlsteenwyk

Despite structure tokens representing an input modality unique to ESM-3, they don't create a new feature vocabulary

The 15.2% of features most activated by structure tokens are more convergent with sequence-only ESM-2 than invariant features (r=0.54 vs 0.45).

22d5754

🧬Jacob L Steenwyk@jlsteenwyk

And the features both models agree on are the ones that matter.

Convergent features → AUROC 0.925 on functional site detection. Architecture-unique features → 0.661.

Shared = signal. Unique = mostly noise.

22d7663

Chris Hayduk@ChrisHayduk

@jlsteenwyk @KestenGal Wow this is a very cool finding

22d2803

Diego del Alamo@DdelAlamo

@jlsteenwyk Really nice work, and a great summarization of your finding!

22d171

🧬Jacob L Steenwyk@jlsteenwyk

@ChrisHayduk @KestenGal :)

22d2582

🧬Jacob L Steenwyk@jlsteenwyk

@DdelAlamo Thank you! :D

22d12