6h ago

Yoav Goldberg argues that BitFit bias-only fine-tuning succeeds due to the same underlying mechanisms as steering vectors

The methodology inspired research on tuning only normalization layers.

4451305.4K

——0——

Original post

i cant believe I just realized this now, but the reason BitFit (bias only fine-tuning) works, is actually the same reason steering vectors work. or rather, bitfit offers a richer class of adaptations than steering vectors.

8:36 AM · May 26, 2026

#92(((ل()(ل() 'yoav))))👾@YOAVGO

@DimitrisPapail i remember this one!

Dimitris Papailiopoulos@DimitrisPapail

@yoavgo I loved BitFit. Was part inspiration for this work of ours https://arxiv.org/abs/2302.07937

3:52 PM · May 26, 2026 · 1.4K Views

4:25 PM · May 26, 2026 · 254 Views

#197Dimitris Papailiopoulos@DIMITRISPAPAIL

@yoavgo I loved BitFit. Was part inspiration for this work of ours https://arxiv.org/abs/2302.07937

(((ل()(ل() 'yoav))))👾@yoavgo

3:36 PM · May 26, 2026 · 3.9K Views

3:52 PM · May 26, 2026 · 1.4K Views

Yoav Goldberg argues that BitFit bias-only fine-tuning succeeds due to the same underlying mechanisms as steering vectors

Sentiment

Cluster engagement