New Paper Explains Why Larger Models Retain Rare Skills Better · Digg