3h ago

Will Depue proposes a distance-based training penalty for weight interpretability, which Ashwinee Panda critiques as heavy-handed

Standard training typically scrambles weight matrices learning identity functions.

0
Original post

random cute idea: make models more visually interpretable by adding a tiny ‘distance‘ penalty which encourages connecting close in row space. if you look at a weight matrix that learns the identity, everything is usually scrambled hmm what other cute penalties are there

7:30 AM · May 28, 2026 View on X

@willdepue auxiliary losses are a butcher’s move

will depuewill depue@willdepue

random cute idea: make models more visually interpretable by adding a tiny ‘distance‘ penalty which encourages connecting close in row space. if you look at a weight matrix that learns the identity, everything is usually scrambled hmm what other cute penalties are there

2:30 PM · May 28, 2026 · 5.2K Views
3:22 PM · May 28, 2026 · 197 Views