Shameless plug but this nice work supports our ICLR paper—ICL Activation Alignment—pretty much spot on.
- Activations (internals) provide a much stronger learning signal than just tokens.
- Brings sample efficiency and avoids spurious correlation learning.
Links below: