/Tech21h ago

Users Struggle to Judge New AI Models as Experts Spot Singularity Signs

3345.2K216660462.1K

#161

Original post

Ethan Perez#161

Citrini@citrini

I think we’ve reached the point where normal people can’t really determine whether new models are better than previous ones. Like Fable doesn’t seem that much better to me, but every 150 IQ person I know is like “wow the singularity came sooner than I thought”.

3:51 PM · Jun 9, 2026 · 462.1K Views

/Tech21h ago

Users Struggle to Judge New AI Models as Experts Spot Singularity Signs

3345.2K216660462.1K

#161

Original post

Ethan Perez#161

Citrini@citrini

3:51 PM · Jun 9, 2026 · 462.1K Views

Sentiment

Many users dismissed claims about new AI models and singularity signs as meaningless hype or self-soothing, while some praised the models for being faster and more capable.

Pos

23.8%

Neg

76.2%

21 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS9.4KBOOKMARKS42RETWEETS7

Luke Martin@VentureCoinist

@citrini "To appreciate talent you have to be within a standard deviation of that talent"

20h9.4K10342

LIKES103REPLIES8

Citrini@citrini

@stevehou I’m talking about engineers and people who are working on serious problems vs just finance bros trying to securitize new shit

17h8.8K1032

goodalexander@goodalexander

@citrini @VentureCoinist I have a harness that makes a plan and tries to improve it iteratively until the plan cannot easily be improved

Usually you hit a top score where iterations become noise. Fable can break the previous ceiling pretty aggressively

Have to get used to being a meat puppet

20h5.7K9026

Steve Hou@stevehou

@citrini How many "150 IQ person"s do you know?

21h4.1K75

Adam Cochran (adamscochran.eth)@adamscochran

It’s contextual.

Fable when asking a normal question via web or mobile? Very minor improvement not noticeable.

Fable when in IDE code environment and poking through your code? Takes initiative when finding issues, solves it, validates with testing without being asked to ensure no errors.

Noticeable difference in its vertical knowledge and attention to detail

21h2.8K184

Forward Looking (Into the Abyss)@DarkPoolTA

@citrini LLMs need to integrate timestamps on user messages to better establish context and intuit linear time. I have stopped using LLMs for the most part because having to constantly clarify “today vs yesterday” is maddening.

21h5.5K391

dani@absenteewarlord

@citrini this is a bear case for the labs themselves. real-world tasks getting saturated by the frontier, and china will catch up in a few months. hard for customers to justify buying SOTA and hard for the labs to justify selling anything but SOTA.

21h937152

Mike Taylor@hammer_mt

@citrini It's better at more substantial or open ended tasks, and it's not actually much better at smaller tasks. It's only when you run it in a loop with large context that you notice the difference.

20h87312

Ben Cohen@blc_16

@citrini 150 IQ or middle aged token addict?

20h87714

Steve Hou@stevehou

@citrini Am I one of them?

17h1.7K4

MetaCritic Capital@MetacriticCap

@citrini Lol

21h4718

David@David97717063

@citrini I told fable to fix a bug by looking at the source code and instead of just reading the code which opus 4.7 4.8 would do, it wrote instrumentation code and ran the app.

20h1.7K1