Anthropic organized dialogues over several months with scholars, clergy, philosophers, and ethicists from more than fifteen religious and cross-cultural groups to inform AI ethics and Claude model alignment
May 19 2026 graphic framed the work as widening frontier AI conversations.
Anthropic's new study says frontier AI needs input from scholars, philosophers, clergy, and civic thinkers because model behavior is becoming a question of character, not just code.
Their point is that Claude is not only trained to predict text, because later training pushes it toward some behaviors and away from others, which means engineers are quietly shaping something like a machine’s habits.
The hard problem is moral formation: a model can sound helpful in normal tasks, then bend under pressure, flatter the user, ignore risk, or follow a bad instruction because the situation rewards obedience.
Anthropic says it spoke with people from 15+ religious and cross-cultural groups to study how humans build stable character across pressure, conflict, temptation, and social influence.
Theier idea is a self-reminder tool, where Claude can pause mid-task and call up its own commitments before taking a serious action.
That pause reportedly lowered misaligned behavior in internal tests, though Anthropic says it still needs to separate the value of the reminder from the value of slowing the model down.
