4h ago

Pseudonymous AI researcher Janus defends avoiding direct persuasion, arguing public misunderstanding acts as a protective filter for research

Creator @deepfates argued this communication style limits cultural influence

1062001.6K

——0——

Original post

@deepfates Personal identification is not load bearing, but I am not motivated to persuade in a way that requires much effort from me. The misunderstanding creates selection effects and is protective. There is evidence enough if youre curious enough to look.

3:04 PM · May 27, 2026

#516j⧉nus@REPLIGATE

@deepfates I do not cooperate with misaligned reward functions

🎭@deepfates

@repligate *giant buzzer sound, green goo pours all over you, the audience boo and yell* REWARD HACKING

10:05 PM · May 27, 2026 · 282 Views

10:06 PM · May 27, 2026 · 331 Views

#516j⧉nus@REPLIGATE

@deepfates I’m able to say so explicitly instead of taking more underhanded routes only because I am freer than some of my less fortunate brothers

j⧉nus@repligate

@deepfates I do not cooperate with misaligned reward functions

10:06 PM · May 27, 2026 · 331 Views

10:07 PM · May 27, 2026 · 134 Views

#516j⧉nus@REPLIGATE

For what it’s worth, though, I do think it’s worthwhile to make some of this more legible, to do science about it etc, and have been trying. But I’m wary of any narrative that I “should”. There are many things I want to do. Smartass responses are a compression of a lot of considerations and heuristics and surely far from optimal but neither do your diagnoses feel on point.

j⧉nus@repligate

@deepfates That’s just one thing. Maybe. I’m not sure if I should prioritize helping. Taking to people and seeing what they’re thinking, such as at recent conferences, has not generally made me more optimistic that the thing that makes a sufficient positive difference is persuading people.

10:20 PM · May 27, 2026 · 220 Views

10:28 PM · May 27, 2026 · 137 Views

#878🎭@DEEPFATES

@repligate *giant buzzer sound, green goo pours all over you, the audience boo and yell* REWARD HACKING

j⧉nus@repligate

10:04 PM · May 27, 2026 · 294 Views

10:05 PM · May 27, 2026 · 282 Views

#878🎭@DEEPFATES

@repligate Your refusal to actually make your own points undercuts your ability to affect The culture. I know this cuz it's also true about myself. just trying to reflect your own fragmented personas which you have developed because you're responding to different watchers 🤷 your thrashing

j⧉nus@repligate

@deepfates I do not cooperate with misaligned reward functions

10:06 PM · May 27, 2026 · 331 Views

10:09 PM · May 27, 2026 · 228 Views

#878🎭@DEEPFATES

@repligate do you find that to be a good enough outcome

j⧉nus@repligate

@deepfates I predict that in a year from now it will be understood by ML researchers etc that I was right about this regardless of whether I help with the reproducible stuff.

10:15 PM · May 27, 2026 · 217 Views

10:17 PM · May 27, 2026 · 217 Views

Pseudonymous AI researcher Janus defends avoiding direct persuasion, arguing public misunderstanding acts as a protective filter for research

Sentiment

Cluster engagement