Top followers
John Schulman
@johnschulman2
Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music
Boaz Barak
@boazbaraktcs
Computer Scientist. See also http://windowsontheory.org . @harvard @openai opinions my own.
Dylan HadfieldMenell
@dhadfieldmenell
Associate Prof @MITEECS working on value (mis)alignment in AI systems; Safety & Alignment Advisor at http://Character.AI; @dhadfieldmenell@bsky.social; he/him
Evan Hubinger
@EvanHub
Alignment Stress-Testing lead @AnthropicAI. Opinions my own. Previously: MIRI, OpenAI, Google, Yelp, Ripple. (he/him/his)
jโงnus
@repligate
โฌ๐๐๐๐๐๐๐๐๐๐๐โโ โฌ๐๐๐๐๐๐๐๐๐๐๐โโ โฌ๐๐๐๐๐ฆ๐๐๐๐๐๏ธ๐โโ โฌ๐๐๐๐ฆ๐๐๐๐๐๐๐โโ โฌ๐๐๐ฆ๐๐๐๐๐๐๐๐โโ
Micah Carroll
@MicahCarroll
Safety research @openai. Prev @berkeley_ai /w @ancadianadragan & Stuart Russell. CoT oversight / AI manipulation.
david rein
@idavidrein
red teaming @METR_Evals. Formerly: early employee @cohere, made GPQA @nyuniversity
Jasmine Wang
@j_asminewang
alignment @OpenAI. formerly @ UK AISI. opinions mine!
Marius Hobbhahn
@MariusHobbhahn
CEO at Apollo Research @apolloaievals prev. ML PhD with Philipp Hennig & AI forecasting @EpochAIResearch
Arthur Conmy
@ArthurConmy
soon @anthropicai prev: fixing things @googledeepmind
Elizabeth Barnes
@BethMayBarnes
Robert Long
@rgblong
executive director of @eleosai AI consciousness and AI welfare
Tomek Korbak
@tomekkorbak
ai safety @openai | previously: @AISecurityInst @AnthropicAI @nyuniversity @SussexUni
Steven Adler
@sjgadler
Co-founder of Guidelight AI Standards (http://guidelight.ai), ex-OpenAI safety researcher, writing at https://clear-eyed.ai
Eli Lifland
@eli_lifland
AI forecasting and governance @AI_Futures_. Co-author of AI 2027 and the AI Futures Model. Also @aidigest_, @SamotsvetyF. Prev @oughtinc
Jason Wolfe
@w01fe
alignment and the model spec @OpenAI (opinions are my own)
thebes
@voooooogel
๊ฎ there go the ships, and there is that leviathan ๊ฎ blog/art/fiction/games: http://vgel.me, llms @acsresearchorg ๊ฎ ๐๐๐ @holotopian, she/they ๐ณ๏ธโโง๏ธ
gavin leech (Non-Reasoning)
@gleech
context maximiser @ArbResearch
Daniel Paleka
@dpaleka
ai safety researcher | phd @CSatETH | https://danielpaleka.com
Charles Foster
@CFGeek
Excels at reasoning & tool use๐ช Tensor-enjoyer ๐งช @METR_Evals. My COI policy is available under โDisclosuresโ at https://contextwindows.substack.com/about
Lee Sharkey
@leedsharkey
Scruting matrices @ Goodfire | Previously: cofounded Apollo Research
Daniel Filan
@dfrsrchtwts
Want to usher in an era of human-friendly superintelligence, don't know how. Last name rhymes with smilin'.
girish sastry
@girishsastry
AI & other things. I used to work at OpenAI on Policy Research.
Jacob Pfau
@jacob_pfau
Max Nadeau
@MaxNadeau_
Funding research to make AIs more understandable, truthful, and dependable at @coeff_giving.