16h ago

e/acc founder Guillaume Verdon proposes 'RLHaHaF' to align large language models using human humor feedback

The concept plays on Reinforcement Learning from Human Feedback.

0
Original post

LLMs are consistently unfunny and uncool. Who’s solving this

8:28 PM · May 28, 2026 View on X

We need RLHaHaF

bayesbayes@bayeslord

LLMs are consistently unfunny and uncool. Who’s solving this

3:28 AM · May 29, 2026 · 24.1K Views
8:50 AM · May 29, 2026 · 7.7K Views