Stanford NLP's Peter Hase says LLMs fail to detect internal state tampering in gaslight control experiments · Digg