4d ago

Fast-Slow Training Lets LLMs Adapt Continually Without Forgetting Skills

20531112561117.7K

——0——

Original post

#196@AGARWL_@KUSHASAREEN

Kusha Sareen@KUSHASAREEN

Can LLMs adapt continually without losing base skills? Fast-Slow Training (FST) pairs "slow" weights with "fast" context. FST vs. RL: • 3x more sample-efficient • Higher performance ceiling • Less KL drift (better plasticity) • Continual learning: succeeds where RL stalls

8:37 AM · May 13, 2026

Cluster Engagement

Engagement snapshots are unavailable for this cluster.no post metric buckets

Reposted by

#196@AGARWL_

QUOTE POST

#196Rishabh Agarwal@AGARWL_

And see the tweet thread from @KushaSareen here

Kusha Sareen@KushaSareen

3:37 PM · May 13, 2026 · 113K Views

12:24 AM · May 15, 2026 · 4.7K Views