/AI18h ago

AI technologist @deepfates issues a public request to identify researchers working on humanistic interpretability

Will Brown suggested Meta is active in the field.

251488146.9K
Original post
🎭@deepfates#862inAI

humanistic interpretability. who's working on this

9:16 AM · Jun 7, 2026 · 6.6K Views
Sentiment

Positive users show interest in collaborating on humanistic interpretability using tools like talkie since the concepts match their work on alignment and models, while negative users see the proposal as deeply misaligned.

Pos
75.0%
Neg
25.0%
5 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS347
will brown@willccbb

@deepfates meta

🎭@deepfates

humanistic interpretability. who's working on this

2hViews 347Likes 5Bookmarks 0
BOOKMARKS3REPLIES1
Nick Levine@status_effects

i guess two directions: working with people across humanistic disciplines to better understand data and models, and using models like talkie to make progress on questions of interest to humanists. also we have office space in sf now if you ever want to come and chat (or virtually works as well)

15hViews 94Likes 9Bookmarks 3
LIKES10
Nick Levine@status_effects

@deepfates talkie team very interested in this (depending on the definition)

16hViews 274Likes 10Bookmarks 2
🎭@deepfates

@status_effects I'm very interested in talkie!! What do you define the interesting here

15hViews 248Likes 6
phil@big_algocracy

@deepfates models are strange half-silvered mirrors

15hViews 147Likes 3
Asuka Zheng🎀@VoidAsuka

@deepfates @NicoleSHsing basically her startup

4hViews 256Likes 2
Mad ML scientist@HououinTyouma

@deepfates @eigengenesis meme makers

15hViews 82Likes 2
Tom Wilson@monistowl

@deepfates Humanists?

17hViews 335
shayan@localmishima

@deepfates psychoanalysis

17hViews 47Likes 1

@deepfates that is very close to my work. A kind of humanistic Tao of interpretability, transcendent alignment, & behavioral insights. It still feels like things are siloed, & there isn’t a neighborhood with an address for us to all find one another.

10hViews 40Likes 1
steve@stevender_

@deepfates Meta to serve you better ads

16hViews 77
Man, Machine, Self@FleischmanMena

@deepfates Considered deeply misaligned if it works (as eval, gets called social credit score, as working process, gets called "thing I don't want run on me at the airport")

6hViews 22Likes 1
Sam Cymbaluk@SemanticSamuel

@deepfates We are @BuildCoherence. A prerequisite to human interpretability is having coherent and well-defined beliefs. Which is why we're developing a protocol for using using logical structure rather than prose as the substrate for communication.

13hViews 45
CosmicEgg.Earth@CosmicEggEarth

@deepfates For millennia. That's what we do.

15hViews 45
Guilherme O'Tina@guilhermeotina

@deepfates i think mech interp has been great at finding features and circuits but less good at producing insights that change how you actually use or debug a model. humanistic would prioritize whether the explanation shifts your mental model over whether it decomposes perfectly

15hViews 44
huli@honorablepicnic

@status_effects @deepfates I thought this was a shit post but cool that you didn't

14hViews 28
Load more posts