10h agoStudy Shows NLAs Fail to Interpret Steered Activations Faithfullyโโ0โโOriginal postDK#1002@DANIELKHASHABIOPโ-โ-modal|@NOAHCHREINThis is interesting I definitely think of prompting as steering in activation space but I took for granted that I could always come up with some, perhaps complex, prompt to steer activations however I wanted. Guess I was wrong!7:13 PM ยท May 18, 2026 View on XReposted byDK#1002|@DANIELKHASHABI