/Tech3h ago

ALTER founder David Manheim and creator Rohit debate whether RLHF produces expert-level quality or merely averages layperson preferences

Rohit points to mediocre popular media to challenge the optimism

220034

#1315

Original post

rohit@krishnanrohit#1315inTech

@davidmanheim I look at the slopfest in popular media and wonder

David Manheim@davidmanheim

@krishnanrohit Yeah, but this goes a step beyond blending the pretraining plus average of the layperson-provided RLHF, to where experts like it. If it's interpolating to find an average, you'd expect it to do worse. (Or is literary beauty like human faces, where averages are the ideal?)

7:20 AM · Jun 21, 2026 · 17 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS10LIKES1

David Manheim@davidmanheim

@krishnanrohit It seems a bit ironic, though on reflection not surprising, that media's hill-climbing for engagement produces much worse quality than averaging of human output for quality.

If nothing else, because journalists are optimally ignorant; https://davidmanheim.substack.com/p/its-time-for-some-game-theory-about-game-theory-about-game-theory-6791629fbe8c

3h101