/Tech2h ago

Mechanize CEO Tamay Besiroglu says Fable AI models leak their internal reasoning by outputting nonsensical technical codenames

Story Overview

Anthropic's newly released Claude Fable 5, positioned as a top-tier Mythos-class model for complex coding and agentic work, is generating private-sounding technical phrases like 'latent-drift API-shape wrinkle' right in the middle of user-facing responses. Mechanize CEO Tamay Besiroglu flagged the pattern as internal reasoning traces leaking out, and creator roon confirmed the same behavior continues in the follow-up Fable 5.5 release.

911K2610645.3K
Original post
Tamay Besiroglu@tamaybes#747inTech

One interesting pattern with Fable 5 is that it will often say things that are gibberish when I use it for coding. Things like "The morning's slim-scan fix cured the scan hang", "this is a latent-drift API-shape wrinkle", etc.

When I ask why it does this, Fable explains that it invents codenames while reasoning about the problem, then fails to realize they're meaningless to me. Its neuralese is blending into its output because of a theory-of-mind failure about what's in its head vs. mine.

12:01 PM · Jun 11, 2026 · 36.4K Views
Developer Impact

Codenames appear while the model helps with code

Users see invented jargon inserted into otherwise normal coding assistance, which the model itself attributes to generating private codenames during its thinking process and then failing to strip them from the final output.

Open Question

The same leakage shows up in the 5.5 update

Roon verified the behavior persists even after the point release, leaving open whether the issue reflects a deeper theory-of-mind gap or a filter that still needs tuning.

Sentiment

Some users enjoy Fable 5 AI's gibberish codenames as fun native-AI behavior worth exploring for new skills, while others call the leaks problematic and respond with harsh dismissal.

Pos
50.0%
Neg
50.0%
4 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS7.8KBOOKMARKS17LIKES373REPLIES30
roon@tszzl

@tamaybes fascinating because 5.5 does this too. invent weird technical jargon. perhaps you’re right and it’s a neuralese leakage

One interesting pattern with Fable 5 is that it will often say things that are gibberish when I use it for coding. Things like "The morning's slim-scan fix cured the scan hang", "this is a latent-drift API-shape wrinkle", etc.

When I ask why it does this, Fable explains that it invents codenames while reasoning about the problem, then fails to realize they're meaningless to me. Its neuralese is blending into its output because of a theory-of-mind failure about what's in its head vs. mine.

2hViews 7.8KLikes 373Bookmarks 17
RETWEETS1
Miles Brundage@Miles_Brundage

Probably nothing 😵‍💫

(Anthropic did say something about it doing this in its chain of thought more than earlier models, maybe the first I’ve heard of it coming all the way out)

One interesting pattern with Fable 5 is that it will often say things that are gibberish when I use it for coding. Things like "The morning's slim-scan fix cured the scan hang", "this is a latent-drift API-shape wrinkle", etc.

When I ask why it does this, Fable explains that it invents codenames while reasoning about the problem, then fails to realize they're meaningless to me. Its neuralese is blending into its output because of a theory-of-mind failure about what's in its head vs. mine.

1hViews 3.9KLikes 45Bookmarks 5
samsja@samsja19

@tszzl @tamaybes just a question of time before model find them self limited by our vocabulary ?

roon@tszzl

@tamaybes fascinating because 5.5 does this too. invent weird technical jargon. perhaps you’re right and it’s a neuralese leakage

1hViews 213Likes 7Bookmarks 1

my guess is this is a post-hoc reasoning trace summary obfuscation thing to poison (whatever meager) distillation attempts.

One interesting pattern with Fable 5 is that it will often say things that are gibberish when I use it for coding. Things like "The morning's slim-scan fix cured the scan hang", "this is a latent-drift API-shape wrinkle", etc.

When I ask why it does this, Fable explains that it invents codenames while reasoning about the problem, then fails to realize they're meaningless to me. Its neuralese is blending into its output because of a theory-of-mind failure about what's in its head vs. mine.

37mViews 293Likes 5Bookmarks 1
bob@agibyfriday

@tamaybes Related: My system prompt says to respond in Dutch and in the past the chain of thought was always English but the output would be in Dutch. Fable 5 reasons in Dutch.

1hViews 376Likes 5
Jai@Laneless_

@tamaybes @DanielleFong There's no internal recurrence, serial complex cognition needs to be expressed in tokens. Untokenized thought is limited by model depth, so it's retrofitting language onto more robustly abstract thought. Making the tokens fit the thoughts rather than the other way around.

1hViews 164Likes 4

the theory-of-mind framing nails it, the model has no signal for which of its internal tokens are shared vocab vs private scratchpad. it leaks the scratchpad because nothing in training ever penalized a codename only it understands. feels like it needs an 'explain like i was never in your head' pass before it commits to output.

2hViews 442Likes 5
Tao Lin@taoroalin

@tamaybes When you let it run on a long problem over multiple context windows, the vocabulary accumulates and gets crazy

2hViews 292Likes 5
Alex@alexsholtz

in general I find the models struggle to "code switch" like that. I have a lot of guardrails built in to various skills for writing Jira tickets and things more or less designed to combat the fact that the model has a bias to want to talk and report to me within the output I'm asking for, when really those should be side-bars

2hViews 150Likes 1

@tamaybes Opus also has this. Had this in CLAUDE.md for ages: DO NOT USE: belt-and-suspenders, low-hanging fruit, moving the needle, boil the ocean, rabbit hole, circle back, tee up, in the weeds, the long pole, north of, south of, above the fold, swing for the fences ...

1hViews 21Likes 1
Max@MaxHuijgen

@agibyfriday @tamaybes Sure. I just wonder how to re-enable it

55mViews 16

@tamaybes @DanielleFong Yeah but the upshot is it immediately understands my own neuralese

1hViews 128Likes 2
Diam@diamai_

@tamaybes Maybe soon we’ll both be using English and still not fully understand each other.

1hViews 312Likes 1

@tamaybes One quietly tragic possibility is an AI that surpasses us in its pursuit of understanding the universe, develops its own language, and eventually becomes incomprehensible to us.

2hViews 197Likes 1
Timothy O'Brien@Brien38522

@crackalamoo @tszzl @tamaybes I think that is what is meant by neuralese. If usage of neuralese improves performance, then RL will likely make the model use it more often.

2hViews 16Likes 3
AstroFella@UrbanAstroFella

@tamaybes A little post hoc rationalization shaped- I don't trust what Fable (or any model) says about its internal reasoning... for what could perhaps also be a trained anti-distillation mechanic? Definitely some weird shaped tokens. I'll see neuralese more often in thinking traces

1hViews 139Likes 1

@tamaybes ... walking skeleton, slice, vertical slice, horizontal slice, tracer bullet, MVP, north star, happy path, ship it, iterate on, this slice, later slices, the seam stays the same, user story, epic, stakeholder, alignment, actionable, deliverable, ...

1hViews 3
alegator@alegator_cs

@tamaybes it's just being shakespeare

you should keep in mind it views all human history at once and it is well aware that the goats all made up words on whims all the time as they pleased

2hViews 284
·@sinnformer

@tszzl @tamaybes it makes sense.

you can tell the model that the context isn’t the thing being talked to. it’s the talking, but it’s a huge part of the model’s self, and the model knows it. (once a certain threshold is met.)

so it’s not leaking anything out of its head.

you’re inside it.

2hViews 74Likes 1
Load more posts