Periodic Labs' Rohan Pandey jokes that backpropagation is the ultimate interpretability researcher, easily identifying and steering neuron behaviors · Digg