8h ago

Anthropic organized dialogues over several months with scholars, clergy, philosophers, and ethicists from more than fifteen religious and cross-cultural groups to inform AI ethics and Claude model alignment

May 19 2026 graphic framed the work as widening frontier AI conversations.

0
Original post

Anthropic: 'We've been meeting with thinkers and practitioners from across religious, philosophical, and humanist traditions and a cross-section of political beliefs to learn from how they’ve thought about these questions. This work isn’t about aligning our models with any one tradition’s worldview; we want Claude to draw from a full range of viewpoints—religious, secular, political—with equal depth and rigor (indeed, this is one of the principles laid out in Claude's constitution). What we’re after in these conversations is careful, accumulated thinking on how good character actually forms.'

4:34 PM · May 19, 2026 View on X

Anthropic's new study says frontier AI needs input from scholars, philosophers, clergy, and civic thinkers because model behavior is becoming a question of character, not just code.

Their point is that Claude is not only trained to predict text, because later training pushes it toward some behaviors and away from others, which means engineers are quietly shaping something like a machine’s habits.

The hard problem is moral formation: a model can sound helpful in normal tasks, then bend under pressure, flatter the user, ignore risk, or follow a bad instruction because the situation rewards obedience.

Anthropic says it spoke with people from 15+ religious and cross-cultural groups to study how humans build stable character across pressure, conflict, temptation, and social influence.

Theier idea is a self-reminder tool, where Claude can pause mid-task and call up its own commitments before taking a serious action.

That pause reportedly lowered misaligned behavior in internal tests, though Anthropic says it still needs to separate the value of the reminder from the value of slowing the model down.

7:05 AM · May 20, 2026 · 2.3K Views
Anthropic organized dialogues over several months with scholars, clergy, philosophers, and ethicists from more than fifteen religious and cross-cultural groups to inform AI ethics and Claude model alignment · Digg