e/acc founder Guillaume Verdon proposes 'RLHaHaF' to align large language models using human humor feedback
The concept plays on Reinforcement Learning from Human Feedback.
——0——
QUOTE POST
#839Beff (e/acc)@BEFFJEZOS
We need RLHaHaF
LLMs are consistently unfunny and uncool. Who’s solving this
3:28 AM · May 29, 2026 · 24.1K Views
8:50 AM · May 29, 2026 · 7.7K Views