/AI9h ago

Jack Morris argues LLM humor benchmarks have plateaued since Google's PaLM in 2022 despite scaling

Fable AI's repetitive jokes about NCCL timeouts prompted the assessment

1210811013.3K
Original post
Jack Morris@jxmnop#203inAI

the new Fable still can't tell a joke

i think jokery evals plateaued with Google's PaLM models in 2022, no one has pushed SOTA since then

maybe another 10 trillion parameters will do the trick!

10:59 AM 路 Jun 9, 2026 路 11.7K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS1.8KBOOKMARKS2LIKES11

if i had a NCCL for every bad LLM joke, i could train a funny LLM

the new Fable still can't tell a joke

i think jokery evals plateaued with Google's PaLM models in 2022, no one has pushed SOTA since then

maybe another 10 trillion parameters will do the trick!

2hViews 1.8KLikes 11Bookmarks 2