/Tech2h ago

Prime Intellect's Florian Brand says OpenAI's Codex generates code that is either entirely correct or obviously flawed

His evaluations heavily favor GPT models over Anthropic's Claude.

133111.8K
Original post
Florian Brand@xeophon#1190inTech

with codex, i feel like the message is either correct or so bad that it’s very obvious if you know what you are doing.

this could be me being more used to gpt, I haven’t used Claude in ~7-8 months as my main model now

spent the day with fable on a bunch of random stuff and it’s very spiky, imo.

it is brilliant in the same message where it makes 1-3 severe mistakes, which means you have to check even more stuff more in-depth, wasting time and tokens

8:20 AM · Jun 11, 2026 · 560 Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS43BOOKMARKS1LIKES2
ueaj@_ueaj

@xeophon Empathy is an extremely underrated skill in the agentic era for this reason. I have a very good sense of when Claude will or has gotten something wrong. I code entirely in CC with no editor/IDE and I can just feel when it's done smth stupid

1hViews 43Likes 2Bookmarks 1
RETWEETS1

@dejavucoder well there’s no pro in codex, but pro is an insanely good model that barely makes mistakes

2hViews 1.3KLikes 15Bookmarks 1