/Tech1d ago

Jack Morris argues LLM humor benchmarks have plateaued since Google's PaLM in 2022 despite scaling

Fable AI's repetitive jokes about NCCL timeouts prompted the assessment

1312311217.7K
Original post
Jack Morris@jxmnop#215inTech

the new Fable still can't tell a joke

i think jokery evals plateaued with Google's PaLM models in 2022, no one has pushed SOTA since then

maybe another 10 trillion parameters will do the trick!

10:59 AM 路 Jun 9, 2026 路 15.2K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS3.9KBOOKMARKS2LIKES24REPLIES1

if i had a NCCL for every bad LLM joke, i could train a funny LLM

the new Fable still can't tell a joke

i think jokery evals plateaued with Google's PaLM models in 2022, no one has pushed SOTA since then

maybe another 10 trillion parameters will do the trick!

21hViews 3.9KLikes 24Bookmarks 2