Researchers from OpenAI and Anthropic publish papers defining quantitative metrics for faithfulness and monitorability in chain-of-thought reasoning by large language models
— Related OpenAI trace spans 125 pages with one insight labeled frightening.