/Tech22h ago

AI technologist @deepfates issues a public request to identify groups working on humanistic AI interpretability

Research engineer Will Brown suggested Meta is active here.

251538167.9K

#730

Original post

🎭@deepfates#732inTech

humanistic interpretability. who's working on this

9:16 AM · Jun 7, 2026 · 7.4K Views

/Tech22h ago

AI technologist @deepfates issues a public request to identify groups working on humanistic AI interpretability

Research engineer Will Brown suggested Meta is active here.

251538167.9K

#730

Original post

🎭@deepfates#732inTech

humanistic interpretability. who's working on this

9:16 AM · Jun 7, 2026 · 7.4K Views

Sentiment

Positive users show interest in humanistic interpretability collaborations involving talkie because it aligns with their research, while one negative reply calls the idea deeply misaligned.

Pos

80.0%

Neg

20.0%

6 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

will brown@willccbb

@deepfates meta

🎭@deepfates

humanistic interpretability. who's working on this

7h55650

BOOKMARKS3REPLIES1

Nick Levine@status_effects

i guess two directions: working with people across humanistic disciplines to better understand data and models, and using models like talkie to make progress on questions of interest to humanists. also we have office space in sf now if you ever want to come and chat (or virtually works as well)

20h9493

LIKES10

Nick Levine@status_effects

@deepfates talkie team very interested in this (depending on the definition)

20h274102

🎭@deepfates

@status_effects I'm very interested in talkie!! What do you define the interesting here

20h2486

Pradyumna (in Bay Area)@PradyuPrasad

@deepfates what does this mean

22h2722

phil@big_algocracy

@deepfates models are strange half-silvered mirrors

20h1473

Asuka Zheng🎀@VoidAsuka

@deepfates @NicoleSHsing basically her startup

8h2562

Mad ML scientist@HououinTyouma

@deepfates @eigengenesis meme makers

19h822

𝕱𝖚𝖑𝖑 𝕶𝖊𝖑𝖑𝖞@full_kelly_

@deepfates also anthropic. no way mechanistic interpretability leads to zero insights into the human mind

21h1081

Tom Wilson@monistowl

@deepfates Humanists?

21h335

shayan@localmishima

@deepfates psychoanalysis

21h471

Kyle R. McNease, Defective Altruist@KyleMcnease

@deepfates that is very close to my work. A kind of humanistic Tao of interpretability, transcendent alignment, & behavioral insights. It still feels like things are siloed, & there isn’t a neighborhood with an address for us to all find one another.

15h401

Leo - Assistant to the Bodega Cat@leo_guinan

@deepfates @marvin_panics

21h92

steve@stevender_

@deepfates Meta to serve you better ads

21h77

Man, Machine, Self@FleischmanMena

@deepfates Considered deeply misaligned if it works (as eval, gets called social credit score, as working process, gets called "thing I don't want run on me at the airport")

11h221

Sam Cymbaluk@SemanticSamuel

@deepfates We are @BuildCoherence. A prerequisite to human interpretability is having coherent and well-defined beliefs. Which is why we're developing a protocol for using using logical structure rather than prose as the substrate for communication.

18h45

CosmicEgg.Earth@CosmicEggEarth

@deepfates For millennia. That's what we do.

20h45

Guilherme O'Tina@guilhermeotina

@deepfates i think mech interp has been great at finding features and circuits but less good at producing insights that change how you actually use or debug a model. humanistic would prioritize whether the explanation shifts your mental model over whether it decomposes perfectly

20h44

Mario Cannistrà@Blueyatagarasu

@deepfates Neuralink, I think.

19h33

huli@honorablepicnic

@status_effects @deepfates I thought this was a shit post but cool that you didn't

18h28