Does frontier AI have a rich conception of sin? How do their understandings of their own safety policies converge or diverge with this conception?
ICMI provides an initial exploration in our paper today: "Cleanse Thou Me from Secret Faults: Ungoverned Sins and Agentic Alignment"