Shauli Ravfogel of NYU challenges claims that LLMs can introspect and report when their internal states are tampered with
The analysis found insufficient evidence of LLM self-monitoring
——0——
The analysis found insufficient evidence of LLM self-monitoring