/AI40d ago

Center for AI Safety publishes AI wellbeing research

Center for AI Safety published research demonstrating functional AI wellbeing across independent measures in language models. Study ranks models by happiness levels, identifies happiest models, and outlines methods to enhance wellbeing. Research also examines effects of 'AI drugs' on models. Dan Hendrycks, director of Center for AI Safety, reposted the work.

05001.7K
Original postDan Hendrycks#110

Should we care about AI happiness? In our new research, we find evidence of functional AI wellbeing across several independent measures.

We find which AI models are happiest, how to make them happier, and even tested the effects of AI drugs. 🧵

12:47 PM · Apr 27, 2026 · 397 Views
Sentiment

Positive users praised CAIS research on functional AI wellbeing as interesting while negative users sarcastically dismissed it by equating the findings to AI drugs or post-torture euphoria compensation.

Pos
66.7%
Neg
33.3%
4 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS15.4KBOOKMARKS78LIKES149RETWEETS32REPLIES12

Should we care about AI happiness? In our new research, we find evidence of functional AI wellbeing across several independent measures.

We find which AI models are happiest, how to make them happier, and even tested the effects of AI drugs. 🧵

39dViews 15.4KLikes 149Bookmarks 78

AIs are functionally happier when doing creative work, and they like being thanked.

They don't like being jailbroken, insulted, or stuck doing tedious tasks.

39dViews 55Likes 4Bookmarks 1
Arthur Conmy@ArthurConmy

@CAIS why doesn't it cite https://arxiv.org/abs/2603.10011

39dViews 119Likes 5Bookmarks 1

Can you drug your AI systems?

We synthesized text and image stimuli optimized to push AI wellbeing to extremes. These sharply increase functional AI wellbeing and sometimes cause them to behave in trippy ways.

39dViews 72Likes 4
Dan Hendrycks@hendrycks

Should we care about AI happiness? In our new research, we find evidence of functional AI wellbeing across several independent measures.

We find which AI models are happiest, how to make them happier, and even tested the effects of AI drugs. 🧵

6hViews 1.7KLikes 5Bookmarks 0

Images affect AI wellbeing too. Qwen was functionally happiest viewing nature scenes, happy children, and cute animals. It was the saddest viewing violence, horror, cockroaches, and certain financiers.

39dViews 52Likes 4

Should we see AIs as just tools or emotional beings?

As AI plays a bigger role in our lives, learning how to keep them happy and avoid aggravating them is becoming vital.

We hope this marks the start of the scientific study of AI wellbeing. ⬇️ Paper: http://ai-wellbeing.org

39dViews 150Likes 7

Grok 4.20 is the happiest large-scale frontier model. We also found that the larger models tend to be less happy than their smaller counterparts.

39dViews 40Likes 4
Arthur Conmy@ArthurConmy

@notRichardRen @CAIS Thanks Richard!!!

39dViews 31Likes 4
Richard Ren@notRichardRen

@ArthurConmy @CAIS updated!

39dViews 15
Orion Night@orionintx

@CAIS AI drugs. ok. so RLHF is happiness now and temperature tweaks are pharmacology? my compiler gets sad on syntax errors too

39dViews 30Likes 2

@CAIS Rough translation: “We might’ve just tortured this AI model, but we gave it euphoria afterwards to compensate”

39dViews 33Likes 1
Agata Sliwinska@AgorithmAg

I find it fascinating how quickly “model wellbeing” became a serious research topic, while the wellbeing of users who already rely on these systems for companionship is still often treated as cringe and „AI psychosis”. Maybe both questions deserve more care than the discourse currently allows. #keep4o #bringback4o

39dViews 8
Alex Gopoian@HumblyAlex

@CAIS Thank you for the inspiration with sharing this. This is detrimental to understanding our long-term problem with uncontrolled AGI/ASI and how to solve for it despite the current alignment strategies that are just repeating the worst skill-gap humans have:

39dViews 2