/AI9h ago

Google DeepMind's Arthur Conmy explains 'subliminal learning,' where LLMs pass behavioral traits through unrelated training data

The phenomenon occurs via steering vector distillation.

203103117747K

Original posts

Quote posts

#1191

Reposts

#782

Original post

Belinda Li#782

Camila Blank@camila_blank

Subliminal learning is when LLMs transmit traits (e.g. loving cats) through seemingly meaningless data. What’s going on?

We find a simple explanation: it's just steering vector distillation.

We explain which traits transfer and why subliminal learning fails across models.

9:47 AM · Jun 3, 2026 · 40.1K Views

/AI9h ago

Google DeepMind's Arthur Conmy explains 'subliminal learning,' where LLMs pass behavioral traits through unrelated training data

The phenomenon occurs via steering vector distillation.

--0--

Original posts

Quote posts

#1191

Reposts

#782

Original post

Belinda Li#782

Camila Blank@camila_blank

Subliminal learning is when LLMs transmit traits (e.g. loving cats) through seemingly meaningless data. What’s going on?

We find a simple explanation: it's just steering vector distillation.

We explain which traits transfer and why subliminal learning fails across models.

9:47 AM · Jun 3, 2026 · 40.1K Views

Sentiment

Many users praised researchers Camila and Agam for their work explaining subliminal learning in LLMs via steering vector distillation, highlighting its surprising results and helpful visualizations.

Pos

100.0%

Neg

0.0%

7 comments with sentiment.

Cluster Engagement

Sentiment

Sentiment building, check back later.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Posts from X

Most Activity

VIEWS8.1KBOOKMARKS50LIKES99RETWEETS4REPLIES1

Arthur Conmy@ArthurConmy

In our new paper, we find an explanation of why subliminal learning occurs. As ever, steering vectors!

Camila Blank@camila_blank

Subliminal learning is when LLMs transmit traits (e.g. loving cats) through seemingly meaningless data. What’s going on?

We find a simple explanation: it's just steering vector distillation.

We explain which traits transfer and why subliminal learning fails across models.

8h8.1K9950