BIO
interpretability & agi safety @ google deepmind | cambridge mmath
David Duvenaud
@DavidDuvenaud
Machine learning prof @UofT. Former team lead at Anthropic. Working on generative models, inference, & latent structure.
Roger Grosse
@RogerGrosse
Ethan Perez
@EthanJPerez
Alignment team lead at Anthropic
Stephanie Chan
@scychan_brains
Staff Research Scientist at DeepMind. Artificial & biological brains 🤖 🧠 Societal impacts of AI + Science of AI. Views are my own.
Dylan HadfieldMenell
@dhadfieldmenell
Associate Prof @MITEECS working on value (mis)alignment in AI systems; Safety & Alignment Advisor at http://Character.AI; @dhadfieldmenell@bsky.social; he/him
Owain Evans
@OwainEvans_UK
Runs an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email to DM.
James Campbell
@jam3scampbell
post training @OpenAI
Rylan Schaeffer
@RylanSchaeffer
AI RS @ Meta TBD. On-Leave from Stanford w/ @sanmikoyejo. Prev @ Gemini, Meta, MIT, Harvard, Uber, UCL, UC Davis
Victoria Krakovna
@vkrakovna
Research scientist in AI alignment at Google DeepMind. Co-founder of Future of Life Institute @FLI_org. Views are my own and do not represent GDM or FLI.
Zhijing Jin
@ZhijingJin
Prof @UofTCompSci. Director @JinesisLab. Founder @EuroSafeAI. Scientist@MPI_IS w/ @bschoelkopf. @CausalNLP, NLP4SocialGood @NLP4SG. Mentor&mentee @ACLMentorship
Andi Peng
@TheAndiPenguin
&i@humans& (co-founder) // prev. building claude @AnthropicAI
Aryaman Arora
@aryaman2020
member of technical staff @stanfordnlp
Trenton Bricken
@TrentonBricken
Trying to make AI go well @AnthropicAI
Hailey Schoelkopf
@haileysch__
hillclimbing towards generality @anthropicai | prev @AiEleuther | views my own
Peter Hase
@peterbhase
AI Institute Fellow at Schmidt Sciences. Postdoc at Stanford NLP Group. Previously: Anthropic, AI2, Google, Meta, UNC Chapel Hill
Laura Ruis
@LauraRuis
Postdoc with @jacobandreas @MIT_CSAIL. PhD from @ucl_dark with @_rockt and @egrefen. Anon feedback: https://www.admonymous.co/laura-ruis
Micah Carroll
@MicahCarroll
Safety research @openai. Prev @berkeley_ai /w @ancadianadragan & Stuart Russell. CoT oversight / AI manipulation.
david rein
@idavidrein
science @METR_Evals. Formerly: early employee @cohere, made GPQA @nyuniversity
tom white
@dribnet
creations with code and networks
Cem Anil
@cem__anil
Machine learning / AI Safety at @AnthropicAI and University of Toronto / Vector Institute. Prev. @google (Blueshift Team) and @nvidia.
Samuel Albanie 🇬🇧
@SamuelAlbanie
frontier evals lead for gemini @GoogleDeepMind
Marius Hobbhahn
@MariusHobbhahn
CEO at Apollo Research @apolloaievals prev. ML PhD with Philipp Hennig & AI forecasting @EpochAIResearch
Daniel Johnson
@_ddjohnson
Member of Technical Staff at @TransluceAI. Building tools to study neural nets and their behaviors. He/him.
Arthur Conmy
@ArthurConmy
prev: fixing things @GoogleDeepMind
Elizabeth Barnes
@BethMayBarnes
Nitarshan
@nitarshan
computer @anthropic, PhD @cambridge_cl. prev created @aisecurityinst, AI Safety Summit, UK AI Research Resource, EU AI Code of Practice.
Ziming Liu
@ZimingLiu11
Assistant Professor @ Tsinghua CollegeAI (incoming), Postdoc @ Stanford, PhD @ MIT, BS @ PKU. Physics of AI, interpretability, Structuralism, KAN
Jeffrey Ladish
@JeffLadish
Applying the security mindset to everything @PalisadeAI
Sonia Joseph
@soniajoseph_
world models @AIatMeta
Michaël Trazzi
@MichaelTrazzi
http://michaeltrazzi.com
Tomek Korbak
@tomekkorbak
ai safety @openai | previously: @AISecurityInst @AnthropicAI @nyuniversity @SussexUni
Zac Kenton
@ZacKenton1
Research Scientist in AI safety at DeepMind. Views are my own and don't represent DeepMind.
Seán Ó hÉigeartaigh
@S_OhEigeartaigh
Director of http://ai-far.org at Uni of Cambridge | Researching Big Risks, and impacts of AI & emerging tech. Opinions own
Yiping Lu
@2prime_PKU
Kernel, ML for PDE, Robust learning,non-parametric stats/🌈/PKU👉Stanford👉NYU Courant👉Prof.@Northwestern IEMS/ Previous Intern @RIKEN_AIP
Toby Shevlane
@tshevl
@_Mantic_AI cofounder & CEO, on a mission to solve forecasting. Prev: research scientist @GoogleDeepMind, PhD at @UniofOxford.
akbir.
@akbirkhan
🐜
Neil Chowdhury
@ChowdhuryNeil
@TransluceAI, previously @OpenAI
Eric J. Michaud
@ericjmichaud_
Trying to make deep neural networks among the best understood objects in the universe. 💻🤖🧠👽🔭🚀
Dr. Roman Yampolskiy
@romanyam
Professor of Computer Science. AI Safety & Security Researcher. For Talks: giacomo@krugercowne.com, Interviews: roman.yampolskiy@louisville.edu
Max Zeff
@ZeffMax
Senior Writer covering AI @WIRED, author of the Model Behavior newsletter | Formerly @TechCrunch, @Gizmodo, @markets | DM me off the record on Signal @ mzeff.88
Oliver Habryka
@ohabryka
Building https://LessWrong.com and https://Lighthaven.space.
Brian Huang
@brianryhuang
@GoogleDeepmind @antigravity | prev math and cs @mit
Jaime Sevilla
@Jsevillamol
Director of @EpochAIResearch. Trying to glimpse the future of AI.
Miles Wang
@MilesKWang
Researcher @OpenAI
Daniel Paleka
@dpaleka
ai safety researcher | phd @CSatETH | https://danielpaleka.com
Joel Becker
@joel_bkr
trying to figure out what on earth is going on with AI capabilities @METR_evals. 'soccer'-me @MessiSeconds.
alex lawsen
@lxrjl
AI Grantmaking @ Coefficient Giving Previously advising @ 80,000 Hours, teaching, forecasting, poker. Views my 🐒's
Saurabh Shah
@saurabh_shah2
human-ing & AI-ing @humansand prev @allen_ai @Apple @Penn 🎤dabbler of things🎸 🐈⬛enjoyer of cats 🐈 and mountains🏔️he/him
Lee Sharkey
@leedsharkey
Scruting matrices @ Goodfire | Previously: cofounded Apollo Research
Ekdeep Singh Lubana
@EkdeepL
Member of Technical Staff @GoodfireAI; Previously: Postdoc / PhD at Center for Brain Science, Harvard and University of Michigan