1d ago

Shauli Ravfogel of NYU challenges claims that LLMs can introspect and report when their internal states are tampered with

The analysis found insufficient evidence of LLM self-monitoring

0
Original post

1/ Can LLMs introspect, i.e., reason about their internal states? Recent work claims LLMs notice when their "thoughts" get tampered with, and can report their content. We looked closely and we think it's too early to say that. Work led by @shashwat_s19 , with @tallinzen and me.

6:16 AM · May 28, 2026 View on X
Reposted by