Yoav Goldberg argues that BitFit bias-only fine-tuning succeeds due to the same underlying mechanisms as steering vectors
The methodology inspired research on tuning only normalization layers.
โโ0โโ
@DimitrisPapail i remember this one!
@yoavgo I loved BitFit. Was part inspiration for this work of ours https://arxiv.org/abs/2302.07937
3:52 PM ยท May 26, 2026 ยท 1.4K Views
4:25 PM ยท May 26, 2026 ยท 254 Views
@yoavgo I loved BitFit. Was part inspiration for this work of ours https://arxiv.org/abs/2302.07937
i cant believe I just realized this now, but the reason BitFit (bias only fine-tuning) works, is actually the same reason steering vectors work. or rather, bitfit offers a richer class of adaptations than steering vectors.
3:36 PM ยท May 26, 2026 ยท 3.9K Views
3:52 PM ยท May 26, 2026 ยท 1.4K Views