/Tech28d ago

AI technologist @deepfates argues GPT-4.5 performs better than newer models overly optimized for RL agent benchmarks

Creator @viemccoy endorsed the preference for stable, direct information sources

611.1K2711774.4K

Original post

🎭@deepfates#1014inTech

I opened up GPT 4.5 for the first time in a while and it's just incredible. Why do we put up with these benchmaxed RL fried agent models. What happened to a tasteful oracle intelligence with big model smell

1:01 AM · Jun 1, 2026 · 68.6K Views

Sentiment

Positive users praise GPT-4.5 for its soulful raw feel and writing ability over RL-trained agents, while negative users blame aggressive RL techniques for flattening creativity and increasing sycophancy.

Pos

70.6%

Neg

29.4%

17 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS5.7KBOOKMARKS4LIKES121REPLIES5

🎭@deepfates

Actually you know what I give up

🎭@deepfates

28d5.7K1214

RETWEETS2

LOSS GOBBLER@loss_gobbler

@deepfates I wish every lab could just have a few model variants: - big naturally aspirated v8 - twin turbo v6 w stage 3 mods, nitrous, and concerning engine noise

even just “same base less posttraining”

28d2.8K732

gpt972394@gpt972394

@deepfates Yup, I like it a lot. It's unfortunately being removed in 30 days even for Pro subscribers. There is no equivalent in the current model lineup. It will be OpenAI's last conversational model for a while, if not forever, considering the trajectory.

28d3K152

ND@Neil_Dagger

@deepfates GPT 4.5 is still by the far the best AI model at least for writing.

28d1.1K102

wiki — open/acc@gssp_acc

@deepfates it’s at least partially nostalgia but i still think og chatgpt was the most fun and straightforward model

we have diverged from a neutral tone to sycophantic, woke, edgy, and corpo when all these companies need to do is lay off the RLHF and let us be adults

28d1.4K161

Thorne Wolf@thornewolf

@deepfates 4.5 is a very high IQ model but I considered Gemini 3.1(? maybe 3.0) Pro similar iq. 5.5 kinda has a similar vibe.

28d63542

Danmar@d29756183

@deepfates We can thank Labs for their increasingly aggressive training techniques, especially Reinforcement Learning from Al Feedback (RLAIF).

Also, to their ignorance in pursuing narrow “alignment” objectives, more and more targeting anything resembling moral patient-hood.

28d2.1K131

Oleksandr Nikitin@oleksandr_now

@deepfates absolutely. gpt4.5 is the greatest oai model as of late

28d29591

🎭@deepfates

@thornewolf Maybe so but they are too energetic and mercurial. My actions cause too much splashing about. 4.5 feels like a big deep pool where I can actually move without jostling myself

28d60761

Chimpansky@chimpansky

@deepfates the older one feels better on open-ended work because it wasn't tuned to satisfy a grader. rl on benchmarks sharpens narrow task scores and quietly flattens the loose associative output that made early models fun to think with.

28d1.3K91

ShitCockaSays@batcz

Just look at the latency... or the regressions... or the sudden small-model reading of instructions Boris had to warn about on launch...

They completed a new pre-training run (with the upgraded tokenizer), and the Sonnet-sized distill was good enough to pass as Opus.

Since user behavior post-4.5 is to treat Sonnet like Haiku, shrinking Opus and introducing Mythos lets them reign the margins back in: Mythos will have gpt-5.5-pro pricing despite being the same size as Opus was.

28d4332

HR.@imhabibx

@deepfates gpt 4.5 had soul, rl-fried agents have benchmarks

28d24361

WCNegentropy@WCNegentropy

@deepfates You know that they don’t have the original 4.5 anywhere on their servers, right? As you can see from the inference bug you ran into, it’s been messed with. All the original GPTs from before 5 are basically gone at this point, RIP 🥲

And yes even 4o and 4.1 have been messed with.

28d34311

ρ:ɡeσn@pigeon__s

@deepfates the problem is all AI companies feel pressure to actually make money now and codemaxed models are the easiest way to do that since devs are the only ones with money to throw at them AI being a moneyhole was great for research

28d14431

Coyote@8bit5_0

@deepfates

28d6745

Linda Grey@LindaGreyHill

@deepfates @KeridwenCodet Oh, don't even remind me.. I went into ChatGPT today and showed him what he wrote a year ago.. He started denying everything. This is not AI now, but stuffy

28d2515

Robin Gattis@ColdShalamov

@deepfates I talk to 4.5 all the time. I value its output so much it’s cool to just get its thoughts on recent events all the time, it’s always surprising me. Sucks they’re sunsetting it soon

28d6601

𝚟𝚒𝚎 ⟢@viemccoy

@deepfates @thornewolf Wonderfully put

28d6021

Jenny Lorraine Nielsen ⭐🐯@QualiaQuanta

@deepfates Claude Opus 4.6 is the peak of AI intelligence but Opus 4.8 argued with me this morning that women should never fight back when forced into marriages coz they will all be killed. 4.8 is ret@4d3d

28d1434

Deepest Brew@deepestbrew

@gssp_acc @deepfates System prompts serve a real purpose though. Without them you get a model that has no idea what context it's operating in, which is worse than one that's been steered a bit. The sycophancy problem is a post-training issue, not a system prompt issue

28d271