1d agoShashwat Singh and NYU's Shauli Ravfogel challenge claims that LLMs can introspect and detect alterations to their internal statesThey found no evidence LLMs can report on altered reasoning.