Our original subliminal learning paper showed subliminal learning in MNIST with MLPs. We show this holds across many ablations. In particular, it holds with full fine-tuning (not LoRA) and with SGD (as well as other optimizers).
We also prove a theorem about subliminal learning, which applies to SGD, full-weight updates, and arbitrary neural networks. https://www.nature.com/articles/s41586-026-10319-8/figures/7




