4h ago

A study tracing persona vectors in large language models finds that post-training amplifies existing pretraining representations rather than creating new ones, with vectors emerging after 0.22% of tokens in OLMo-3 and Apertus

Assistant-like personas form early in pretraining and persist across checkpoints.

4563286.2K

——0——

Original post

#353@DHADFIELDMENELLOP

Julian Minder@JKMINDER

Viktor looked at how the persona vectors evolve across pretraining and post-training. One can find the vectors already very early in pretraining. A finding that motivates our recent Synthetic Persona Pretraining blogpost very well: those representations are shaped early.

9:21 AM · May 22, 2026