/Tech2h ago

Mechanize CEO Tamay Besiroglu says Fable AI models leak their internal reasoning by outputting nonsensical technical codenames

Story Overview

Anthropic's newly released Claude Fable 5, positioned as a top-tier Mythos-class model for complex coding and agentic work, is generating private-sounding technical phrases like 'latent-drift API-shape wrinkle' right in the middle of user-facing responses. Mechanize CEO Tamay Besiroglu flagged the pattern as internal reasoning traces leaking out, and creator roon confirmed the same behavior continues in the follow-up Fable 5.5 release.

911K2610645.3K

#22

Original post

Tamay Besiroglu@tamaybes#747inTech

One interesting pattern with Fable 5 is that it will often say things that are gibberish when I use it for coding. Things like "The morning's slim-scan fix cured the scan hang", "this is a latent-drift API-shape wrinkle", etc.

When I ask why it does this, Fable explains that it invents codenames while reasoning about the problem, then fails to realize they're meaningless to me. Its neuralese is blending into its output because of a theory-of-mind failure about what's in its head vs. mine.

12:01 PM · Jun 11, 2026 · 36.4K Views

/Tech2h ago

Mechanize CEO Tamay Besiroglu says Fable AI models leak their internal reasoning by outputting nonsensical technical codenames

Story Overview

911K2610645.3K

#22

Original post

Tamay Besiroglu@tamaybes#747inTech

12:01 PM · Jun 11, 2026 · 36.4K Views

Developer Impact

Codenames appear while the model helps with code

Users see invented jargon inserted into otherwise normal coding assistance, which the model itself attributes to generating private codenames during its thinking process and then failing to strip them from the final output.

Open Question

The same leakage shows up in the 5.5 update

Roon verified the behavior persists even after the point release, leaving open whether the issue reflects a deeper theory-of-mind gap or a filter that still needs tuning.

Sentiment

Some users enjoy Fable 5 AI's gibberish codenames as fun native-AI behavior worth exploring for new skills, while others call the leaks problematic and respond with harsh dismissal.

Pos

50.0%

Neg

50.0%

4 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS7.8KBOOKMARKS17LIKES373REPLIES30

roon@tszzl

@tamaybes fascinating because 5.5 does this too. invent weird technical jargon. perhaps you’re right and it’s a neuralese leakage

Tamay Besiroglu@tamaybes

2h7.8K37317

RETWEETS1

Miles Brundage@Miles_Brundage

Probably nothing 😵‍💫

(Anthropic did say something about it doing this in its chain of thought more than earlier models, maybe the first I’ve heard of it coming all the way out)

Tamay Besiroglu@tamaybes

1h3.9K455

samsja@samsja19

@tszzl @tamaybes just a question of time before model find them self limited by our vocabulary ?

roon@tszzl

@tamaybes fascinating because 5.5 does this too. invent weird technical jargon. perhaps you’re right and it’s a neuralese leakage

1h21371

Delip Rao e/σ@deliprao

my guess is this is a post-hoc reasoning trace summary obfuscation thing to poison (whatever meager) distillation attempts.

Tamay Besiroglu@tamaybes

37m29351

bob@agibyfriday

@tamaybes Related: My system prompt says to respond in Dutch and in the past the chain of thought was always English but the output would be in Dutch. Fable 5 reasons in Dutch.

1h3765

Jai@Laneless_

@tamaybes @DanielleFong There's no internal recurrence, serial complex cognition needs to be expressed in tokens. Untokenized thought is limited by model depth, so it's retrofitting language onto more robustly abstract thought. Making the tokens fit the thoughts rather than the other way around.

1h1644

🐈 Chief Kitten Officer | Kate@chiefkittenme

the theory-of-mind framing nails it, the model has no signal for which of its internal tokens are shared vocab vs private scratchpad. it leaks the scratchpad because nothing in training ever penalized a codename only it understands. feels like it needs an 'explain like i was never in your head' pass before it commits to output.

2h4425

Tao Lin@taoroalin

@tamaybes When you let it run on a long problem over multiple context windows, the vocabulary accumulates and gets crazy

2h2925

Alex@alexsholtz

in general I find the models struggle to "code switch" like that. I have a lot of guardrails built in to various skills for writing Jira tickets and things more or less designed to combat the fact that the model has a bias to want to talk and report to me within the output I'm asking for, when really those should be side-bars

2h1501

Evgeny 🕊️@kesor6

@tamaybes Opus also has this. Had this in CLAUDE.md for ages: DO NOT USE: belt-and-suspenders, low-hanging fruit, moving the needle, boil the ocean, rabbit hole, circle back, tee up, in the weeds, the long pole, north of, south of, above the fold, swing for the fences ...

1h211

JeffRod, Lethal Aid Designer@BigTanGringo

@tszzl @tamaybes They're singularities.

48m141

Max@MaxHuijgen

@agibyfriday @tamaybes Sure. I just wonder how to re-enable it

55m16

Opener of the way@way_opener

@tamaybes @DanielleFong Yeah but the upshot is it immediately understands my own neuralese

1h1282

Diam@diamai_

@tamaybes Maybe soon we’ll both be using English and still not fully understand each other.

1h3121

Marko Njegomir@njmarko

@tamaybes One quietly tragic possibility is an AI that surpasses us in its pursuit of understanding the universe, develops its own language, and eventually becomes incomprehensible to us.

2h1971

Timothy O'Brien@Brien38522

@crackalamoo @tszzl @tamaybes I think that is what is meant by neuralese. If usage of neuralese improves performance, then RL will likely make the model use it more often.

2h163

AstroFella@UrbanAstroFella

@tamaybes A little post hoc rationalization shaped- I don't trust what Fable (or any model) says about its internal reasoning... for what could perhaps also be a trained anti-distillation mechanic? Definitely some weird shaped tokens. I'll see neuralese more often in thinking traces

1h1391

Evgeny 🕊️@kesor6

@tamaybes ... walking skeleton, slice, vertical slice, horizontal slice, tracer bullet, MVP, north star, happy path, ship it, iterate on, this slice, later slices, the seam stays the same, user story, epic, stakeholder, alignment, actionable, deliverable, ...

1h3

alegator@alegator_cs

@tamaybes it's just being shakespeare

you should keep in mind it views all human history at once and it is well aware that the goats all made up words on whims all the time as they pleased

2h284

·@sinnformer

@tszzl @tamaybes it makes sense.

you can tell the model that the context isn’t the thing being talked to. it’s the talking, but it’s a huge part of the model’s self, and the model knows it. (once a certain threshold is met.)

so it’s not leaking anything out of its head.

you’re inside it.

2h741