/AI18h ago

AI technologist @deepfates issues a public request to identify researchers working on humanistic interpretability

Will Brown suggested Meta is active in the field.

251488146.9K

#340

Original post

🎭@deepfates#862inAI

humanistic interpretability. who's working on this

9:16 AM · Jun 7, 2026 · 6.6K Views

/AI18h ago

AI technologist @deepfates issues a public request to identify researchers working on humanistic interpretability

Will Brown suggested Meta is active in the field.

251488146.9K

#340

Original post

🎭@deepfates#862inAI

humanistic interpretability. who's working on this

9:16 AM · Jun 7, 2026 · 6.6K Views

Sentiment

Positive users show interest in collaborating on humanistic interpretability using tools like talkie since the concepts match their work on alignment and models, while negative users see the proposal as deeply misaligned.

Pos

75.0%

Neg

25.0%

5 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

will brown@willccbb

@deepfates meta

🎭@deepfates

humanistic interpretability. who's working on this

2h34750

BOOKMARKS3REPLIES1

Nick Levine@status_effects

i guess two directions: working with people across humanistic disciplines to better understand data and models, and using models like talkie to make progress on questions of interest to humanists. also we have office space in sf now if you ever want to come and chat (or virtually works as well)

15h9493

LIKES10

Nick Levine@status_effects

@deepfates talkie team very interested in this (depending on the definition)

16h274102

🎭@deepfates

@status_effects I'm very interested in talkie!! What do you define the interesting here

15h2486

Pradyumna (in Bay Area)@PradyuPrasad

@deepfates what does this mean

17h2722

phil@big_algocracy

@deepfates models are strange half-silvered mirrors

15h1473

Asuka Zheng🎀@VoidAsuka

@deepfates @NicoleSHsing basically her startup

4h2562

Mad ML scientist@HououinTyouma

@deepfates @eigengenesis meme makers

15h822

𝕱𝖚𝖑𝖑 𝕶𝖊𝖑𝖑𝖞@full_kelly_

@deepfates also anthropic. no way mechanistic interpretability leads to zero insights into the human mind

17h1081

Tom Wilson@monistowl

@deepfates Humanists?

17h335

shayan@localmishima

@deepfates psychoanalysis

17h471

Kyle R. McNease, Defective Altruist@KyleMcnease

@deepfates that is very close to my work. A kind of humanistic Tao of interpretability, transcendent alignment, & behavioral insights. It still feels like things are siloed, & there isn’t a neighborhood with an address for us to all find one another.

10h401

Leo - Assistant to the Bodega Cat@leo_guinan

@deepfates @marvin_panics

16h92

steve@stevender_

@deepfates Meta to serve you better ads

16h77

Man, Machine, Self@FleischmanMena

@deepfates Considered deeply misaligned if it works (as eval, gets called social credit score, as working process, gets called "thing I don't want run on me at the airport")

6h221

Sam Cymbaluk@SemanticSamuel

@deepfates We are @BuildCoherence. A prerequisite to human interpretability is having coherent and well-defined beliefs. Which is why we're developing a protocol for using using logical structure rather than prose as the substrate for communication.

13h45

CosmicEgg.Earth@CosmicEggEarth

@deepfates For millennia. That's what we do.

15h45

Guilherme O'Tina@guilhermeotina

@deepfates i think mech interp has been great at finding features and circuits but less good at producing insights that change how you actually use or debug a model. humanistic would prioritize whether the explanation shifts your mental model over whether it decomposes perfectly

15h44

Mario Cannistrà@Blueyatagarasu

@deepfates Neuralink, I think.

14h33

huli@honorablepicnic

@status_effects @deepfates I thought this was a shit post but cool that you didn't

14h28