Laura Ruis and Jacob Andreas introduce Self-CTRL to align LLM self-descriptions with their actual prompt behavior · Digg