2d ago

AI Alignment Debate Highlights Data Weighting Effects Over Filtering

2200109

——0——

Original post

@ohabryka @sebkrier "Clearly" we should note that @sebkrier didn't actually say what his plan is in this post. Don't put words in his mouth

11:52 AM · May 14, 2026

Cluster Engagement

Engagement snapshots are unavailable for this cluster.no post metric buckets

#1685Alex Turner@TURN_TROUT

@ohabryka @sebkrier "Clearly" we should note that @sebkrier didn't actually say what his plan is in this post. Don't put words in his mouth

Oliver Habryka@ohabryka

No, this is literally not what the result says! Data filtering does not have a big effect! Upweighing positive stories has a big effect. Also, really, your plan for controlling superintelligence is so hyperstition a meme that "alignment is easy"? Clearly you can't be serious about this.

8:24 PM · May 8, 2026 · 2.3K Views

6:52 PM · May 14, 2026 · 31 Views

#1685Alex Turner@TURN_TROUT

@robertwiblin You can also see the defensiveness in @slatestarcodex's https://blog.aifutures.org/p/against-misalignment-as-self-fulfilling. It doesn't make sense.

Arguments weak, claims too strong. I told them not to post in that form, gave counterevidence, and then http://alignmentpretraining.ai indeed debunked important claims in the post

Alex Turner@Turn_Trout

@robertwiblin Yeah but it's in fact true that self-fulfilling misalignment is a negative impact of LW discourse. That doesn't mean we should ban the discourse or that it was overall bad to discuss. But the negative externality is real

6:50 PM · May 14, 2026 · 91 Views

7:09 PM · May 14, 2026 · 78 Views