/Tech3h ago

Simo Ryu, Stable Diffusion LoRA creator, claims US frontier AI models lead Chinese models by two years

Yaroslav Bulatov challenged the estimate using open-source model comparisons.

188401010.6K

#501

Original post

Simo Ryu@cloneofsimo#957inTech

Hot take: I speculate US frontier models are at least 2 years ahead compared to Chinese frontier models.

2:34 PM · Jun 14, 2026 · 10.1K Views

Sentiment

Many users rejected the speculation that US frontier models lead Chinese AI by at least two years, dismissing it as outdated, untrue, or based on insufficient experience with Chinese models.

Pos

12.5%

Neg

87.5%

10 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS901BOOKMARKS1LIKES10REPLIES3

Simo Ryu@cloneofsimo

Yes I understand that benchmark wise chinese models are only 6 month behind. But I am saying it will take them 2 years to cross the gap, because US models do not distill from other better model.

In other words, I think chinese models would be 2 years behind had they not used output of US models.

Simo Ryu@cloneofsimo

Hot take: I speculate US frontier models are at least 2 years ahead compared to Chinese frontier models.

1h901101

POM@peterom

@cloneofsimo This tells me that you haven't meaningfully used Chinese frontier models

2h49310

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

@cloneofsimo is this related to recent reports about Korean LLM companies defrauding the govt

Simo Ryu@cloneofsimo

Hot take: I speculate US frontier models are at least 2 years ahead compared to Chinese frontier models.

48m63850

Charuru Charuru@CharuruCha14310

@cloneofsimo Uh did you see glm 5.2? It's def ahead of opus 4.5 and probably gpt5.3, maybe others as well, but def ahead of grok 4.3 and gemini 3.1.

2h4461

ShitCockaSays@batcz

you're talking past each other: peak capabilities are much closer than 2 years, but they feel worse because they generalize differently/worse.

China is starting to catch up on pre-training so they have more of that "big model smell", and starting to move away from distillation as a driver for reasoning gains.

(distilling from reasoning traces without logits makes models feel awful out of distribution. like actually brain damaged.)

1h40

Yaroslav Bulatov@yaroslavvb

@cloneofsimo What about the reverse? What past frontier models are current frontier OSS models ahead of? Two year gap would put existing OSS frontier at the level of 3.5 sonnet and gpt-4o

Simo Ryu@cloneofsimo

Hot take: I speculate US frontier models are at least 2 years ahead compared to Chinese frontier models.

3h85940

Liu Liu@liuliu

@cloneofsimo But this hypothetical doesn’t guide anything? They will distill and find way to distill. It is like claiming without OpenAI, the whole LLM revolution will be 3~4 years late. Probably true but not useful?

1h812

JMB 🧙‍♂️@jmbollenbacher

@peterom @cloneofsimo or that he forgets how weak models were two years ago.

probably both. people tend to forget how fast things are going, and china hawks tend not to use chinese models.

2h611

jimbo@jimboiwe

@cloneofsimo they're not even comparable, anthropic is alone

chinese models are unusable when you give them real agentic work

1h151

ShitCockaSays@batcz

like I said, they're just now solving two of the biggest factors: M3 to me is probably the best sign of the pre-training progress so far, and K2.7 is the best sign of the post-training improvements...

but no one Chinese model is really the best of all that recent progress combined. I expect it'll be another generation before it all comes together fully.

1h141

POM@peterom

@batcz @cloneofsimo What you're saying was true until recently, recent models generalise as well as e.g. Opus 4.5 in my extensive experience.

1h36