16h ago

e/acc founder Guillaume Verdon proposes 'RLHaHaF' to align large language models using human humor feedback

The concept plays on Reinforcement Learning from Human Feedback.

108279111531.4K

——0——

Original post

LLMs are consistently unfunny and uncool. Who’s solving this

QUOTE POST

We need RLHaHaF

bayes@bayeslord

LLMs are consistently unfunny and uncool. Who’s solving this

3:28 AM · May 29, 2026 · 24.1K Views

8:50 AM · May 29, 2026 · 7.7K Views

Cluster engagement