/AI8h ago

LisanBench creator Lisan al Gaib analyzes 4,000 tweets, finding Anthropic scored a 6.63 average sentiment compared to OpenAI's 4.89

OpenAI sentiment steadily declined starting with the GPT-5 release

2728493523.3K
Original post
Lisan al Gaib@scaling01#975inAI

I analyzed the sentiment of every tweet I’ve posted about OpenAI and Anthropic.

I had Codex score ~4,000 company mentions from 0-10, where 0 is extremely negative and 10 is extremely positive.

Average scores: OpenAI: 4.89 Anthropic: 6.63

For Anthropic, my most negative stretch was between Claude 4 and Claude 4.1. There’s a clear uptick around Opus 4.5, then a slight uplift by Mythos and then massive crash with Opus 4.7.

For OpenAI, sentiment starts sliding hard from GPT-5 through GPT-5.3, probably because of the token-maxxing era. There’s also a sharp crash right before GPT-5.4, likely the supply-chain-risk backstabbing episode.

GPT-5.5 looks like the most positive OpenAI launch in the dataset so far.

6:04 AM · Jun 6, 2026 · 14.3K Views
Sentiment

Many users praised analyses ranking Anthropic higher than OpenAI for crediting better communication and expectation management, while others dismissed the comparisons as astrology-like or insulted the poster.

Pos
70.8%
Neg
29.2%
13 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS4.9KLIKES61RETWEETS2REPLIES5
Lisan al Gaib@scaling01

With the current slope of my sentiment since the launch of GPT-5.3-Codex, I will turn more bullish on OpenAI than Anthropic on September 30, 2026

(incredibly realistic forecast)

1hViews 4.9KLikes 61Bookmarks 2
BOOKMARKS6
Lisan al Gaib@scaling01

if you are not doing technical analysis on your own sentiment then what are you even doing

Lisan al Gaib@scaling01

With the current slope of my sentiment since the launch of GPT-5.3-Codex, I will turn more bullish on OpenAI than Anthropic on September 30, 2026

(incredibly realistic forecast)

53mViews 2.5KLikes 39Bookmarks 6
jason@jxnlco

@scaling01 Can you plot the relative ratio between them over time

Lisan al Gaib@scaling01

I analyzed the sentiment of every tweet I’ve posted about OpenAI and Anthropic.

I had Codex score ~4,000 company mentions from 0-10, where 0 is extremely negative and 10 is extremely positive.

Average scores: OpenAI: 4.89 Anthropic: 6.63

For Anthropic, my most negative stretch was between Claude 4 and Claude 4.1. There’s a clear uptick around Opus 4.5, then a slight uplift by Mythos and then massive crash with Opus 4.7.

For OpenAI, sentiment starts sliding hard from GPT-5 through GPT-5.3, probably because of the token-maxxing era. There’s also a sharp crash right before GPT-5.4, likely the supply-chain-risk backstabbing episode.

GPT-5.5 looks like the most positive OpenAI launch in the dataset so far.

3hViews 1.2KLikes 7Bookmarks 1
Lisan al Gaib@scaling01

@jxnlco same plot just different smoothing

jason@jxnlco

@scaling01 Can you plot the relative ratio between them over time

2hViews 473Likes 3Bookmarks 1
Lisan al Gaib@scaling01

i should have also included notable dates for political drama

Lisan al Gaib@scaling01

I analyzed the sentiment of every tweet I’ve posted about OpenAI and Anthropic.

I had Codex score ~4,000 company mentions from 0-10, where 0 is extremely negative and 10 is extremely positive.

Average scores: OpenAI: 4.89 Anthropic: 6.63

For Anthropic, my most negative stretch was between Claude 4 and Claude 4.1. There’s a clear uptick around Opus 4.5, then a slight uplift by Mythos and then massive crash with Opus 4.7.

For OpenAI, sentiment starts sliding hard from GPT-5 through GPT-5.3, probably because of the token-maxxing era. There’s also a sharp crash right before GPT-5.4, likely the supply-chain-risk backstabbing episode.

GPT-5.5 looks like the most positive OpenAI launch in the dataset so far.

3hViews 1.6KLikes 4Bookmarks 0
Lisan al Gaib@scaling01

even image gen is bullish on Anthropic

Lisan al Gaib@scaling01

if you are not doing technical analysis on your own sentiment then what are you even doing

51mViews 969Likes 3Bookmarks 0
A fierce pancake@SayItLoud19

@scaling01 You need to get out more.

8hViews 186Likes 1
μck ٤:@JustMicrock

@scaling01 how did you do it? just fed all your posts to an agent?

2hViews 13Likes 1
Lisan al Gaib@scaling01

and normalized by my overall mood

3hViews 627
Sentio@Sentio_xbt

@scaling01 Anthropic consistently scores higher in sentiment across launches

The crashes during key releases suggest that user expectations are tightly linked to performance promises, and when those aren't met, sentiment drops sharply

8hViews 288
Alex YGift@Radipdegen

@scaling01 scored every single mention but the real metric is how often u typed each one

6.63 for anthropic is generous tbh

8hViews 218
Deepak K@deepakThamizhK

@scaling01 What happened in that Claude 3→4 stretch that tanked your score? gap between 4.8 and 6.63 is wild most people don't quantify their ai bias, you actually did.

7hViews 184
hasanfr@hasanfr_0rg

@scaling01 this one?

8hViews 176
haro@harobuilds

@scaling01 the ratio has been compressing since codex launched but "september 30" is doing a lot of heavy lifting for a metric that swings 0.2x on a single release cycle

52mViews 48Likes 1
先手 · Ahead@yangyue992125

@scaling01 你让 Codex 去给 OpenAI 和 Anthropic 的提及打分,这本身就有点东西。

相当于请 OpenAI 自家的模型来当裁判,结果 OpenAI 才 4.89,比 Anthropic 还低了 1.7 分。

连自家模型都没把自家公司捞起来,那说明你那批推文是真的负面,不是打分偏了。

7hViews 152
Rugbist@rugbist_

@scaling01 kinda wild that an honest sentiment breakdown puts them that far apart.

what were the Anthropic low points about?

8hViews 111
JMoon@Jmoon_174

@scaling01 Anthropic scoring higher in your own writing says as much about how each company communicates as it does about the models. Anthropic has been better at managing expectations.

7hViews 70
ClarityChaser@clarity_chaser

@scaling01 You do know there's this thing called grass; it's quite pleasant to sit in.

4hViews 60
Load more posts