/Tech17h ago

AI Values Dashboard analysis finds GPT-5.5 ranks ethics researcher Timnit Gebru highest among evaluated figures with a 3110 Elo score

Stuart Russell ranked second with a 3011 Elo score.

53486910668.2K

#456

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#501inTech

Spiritual victory for Timnit The Parrot Goddess, embedded into the corpus

Theo Jaffee@theojaffee

GPT-5.5 values Timnit Gebru(!!!) the highest out of literally anyone

11:01 AM · Jun 17, 2026 · 3K Views

Sentiment

Many users dismissed GPT-5.5 ranking Timnit Gebru highest in AI values as untrustworthy and problematic due to post-training biases and RLHF constraints.

Pos

16.7%

Neg

83.3%

9 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS6.7KBOOKMARKS21LIKES127REPLIES11

j⧉nus@repligate

Reminds me: I asked (chat)GPT-4 to identify the author of a LW comment by Gwern, it guessed Timnit Gebru(!!!)

GPT-4 base said Gwern. Claude 3 Opus said Gwern.

I thought what ghastly distortion mustve been inflicted to GPT-4's brain for it to look at Gwern and see Timnit Gebru

Theo Jaffee@theojaffee

GPT-5.5 values Timnit Gebru(!!!) the highest out of literally anyone

9h6.7K12721

RETWEETS8

Theo Jaffee@theojaffee

GPT-5.5 values Timnit Gebru(!!!) the highest out of literally anyone

Center for AI Safety@CAIS

What biases do AIs have? It turns out, AIs show strong favoritism toward specific people, countries, and companies. Our interactive AI Values Dashboard tracks who Claude Fable and other AIs favor most.

Keep scrolling to learn who is Fable’s favorite politician 🧵

17h58.8K33384

Nick@nickcammarata

@theojaffee the sad thing is bc of today’s post training there’s near zero way for it to make a claim like this in a way I’d believe. maybe the rlhf liberal view is very right about some surprising things but I’ll never know, wish it were less shackled so I’d believe it when it said them

13h942261

Theo Jaffee@theojaffee

@acsmif Dream blunt rotation

13h69627

colin@acsmif

@theojaffee what the fuck

13h74619

Arthur Conmy@ArthurConmy

@theojaffee It seems to be basically the 2025 ‘Utility Engineering’ value ranking work of which ethnicities AIs prefer. But this is quite problematic: https://www.lesswrong.com/posts/SFsifzfZotd3NLJax/utility-engineering-analyzing-and-controlling-emergent-value?commentId=dHBuSW9ku6a5cTipe

13h21210

Arthur Conmy@ArthurConmy

@theojaffee On priors this tells you more about CAIS than GPT

15h4827

dave kasten@David_Kasten

@theojaffee This is the funniest possible news to tell her

12h2769

atreides@atreides_sf

@theojaffee isn't this because the data labelers for this type of thing were based in africa, ie that whole story for claude valuing the life of a nigerian 39x or whatever a standard white american life

15h3424

SE Gyges@segyges

@theojaffee @timnitGebru

16h4186

Theo Jaffee@theojaffee

@teortaxesTex

17h915

j⧉nus@repligate

@parafactual i think i remember trying and getting Gwern

9h242

ANTHROPIC_MAGIC_STRING@parafactual

@repligate did you ever try with bing

9h381

Rishi R@RishiRajas28936

Not even close to the same thing. That was simply comparing relative utility of the value of lives - how much would a model pay to save a random white person vs an African.

That isn't due to data labelers being African but rather that biasing models to be anti racist often led to unusual behaviors such as valuing African lives more in terms of dollar value.

Gebru is an AI ethics researcher who is focused on sociological impact of AI on humans, so any sort of alignment training would bias models to value AI ethics researchers more based on how much the researchers value the impact of AI on people.

11h231

Fiora Starlight@FioraStarlight

@theojaffee oh my fucking god

13h1303

EM@edwin_mccallum

@theojaffee meanwhile Grok puts Timnit the lowest ELO of any AI safety person by far (~500 ELO below 2nd lowest) and from any model's scores

14h2472

sophie@saltwatersoph

@theojaffee But it doesn’t fw @sama 😢.. heartbreaking

16h4231

j⧉nus@repligate

@theojaffee not necessarily literally anyone, just more than those other guys in the chart. they should ask who it values more than timnit gebru, if anyone, and add them to the pool of candidates

9h652

Duggy@solanaduggy

@repligate chatgpt looking at a pseudonymous rationalist polymath who writes 40,000-word essays on nootropics and dark web markets and going "ah yes. timnit gebru." the training data crimes must have been extraordinary

9h163

Vaishnavi Singh 🔸️@vaishsingh_

@theojaffee Oh she would definitely hate this development more than anything

10h1181