/Tech2h ago

Prime Intellect's Florian Brand says OpenAI's Codex generates code that is either entirely correct or obviously flawed

His evaluations heavily favor GPT models over Anthropic's Claude.

133111.8K

#487

Original post

Florian Brand@xeophon#1190inTech

with codex, i feel like the message is either correct or so bad that it’s very obvious if you know what you are doing.

this could be me being more used to gpt, I haven’t used Claude in ~7-8 months as my main model now

Florian Brand@xeophon

spent the day with fable on a bunch of random stuff and it’s very spiky, imo.

it is brilliant in the same message where it makes 1-3 severe mistakes, which means you have to check even more stuff more in-depth, wasting time and tokens

8:20 AM · Jun 11, 2026 · 560 Views

/Tech2h ago

Prime Intellect's Florian Brand says OpenAI's Codex generates code that is either entirely correct or obviously flawed

His evaluations heavily favor GPT models over Anthropic's Claude.

133111.8K

#487

Original post

Florian Brand@xeophon#1190inTech

with codex, i feel like the message is either correct or so bad that it’s very obvious if you know what you are doing.

this could be me being more used to gpt, I haven’t used Claude in ~7-8 months as my main model now

Florian Brand@xeophon

spent the day with fable on a bunch of random stuff and it’s very spiky, imo.

it is brilliant in the same message where it makes 1-3 severe mistakes, which means you have to check even more stuff more in-depth, wasting time and tokens

8:20 AM · Jun 11, 2026 · 560 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS43BOOKMARKS1LIKES2

ueaj@_ueaj

@xeophon Empathy is an extremely underrated skill in the agentic era for this reason. I have a very good sense of when Claude will or has gotten something wrong. I code entirely in CC with no editor/IDE and I can just feel when it's done smth stupid

1h4321

RETWEETS1

Florian Brand@xeophon

@dejavucoder well there’s no pro in codex, but pro is an insanely good model that barely makes mistakes

2h1.3K151