/Tech8h ago

Kim Isenberg argues no Western text-to-video AI model currently matches ByteDance's Seedance 2.0

Story Overview

ByteDance keeps widening its lead in text-to-video generation with Seedance 2.5, now rolling out through CapCut and Dreamina with longer continuous clips, richer multimodal controls, and stronger consistency than anything currently shipping from Western labs.

45351287934.2K

#353

Original post

Chubby♨️@kimmonismus#1364inTech

Still, no Western text-to-video model comes close to Seedance 2.0, and Seedance 2.5 is already ready.

There are certainly several explanations for this. One of them is that it is often at least claimed that this is because Seedance has access to such a vast amount of video material and does not take copyright protection all that seriously. That is a vague assumption, and honestly, I cannot imagine it being the only reason. Google, in turn, has YouTube, a platform with countless videos that could surely be used to train good models. Just remember when Mira Murati was asked how they had trained Sora and whether YouTube videos had been used for it.

Be that as it may, the more questionable issue is why there seems to be so little interest and focus on video models. My assumption is that they are simply not relevant. They are basically a nice gimmick, but currently negligible in the race for the best models. More specifically, the focus on LLMs, which are making outstanding progress in important areas such as SWE, is simply so much more important for winning overall that one would not use compute for video models instead. OpenAI is known to have completely ended Sora for the moment.

Maybe the more important point is that consumer video is probably not the real endgame for AI video models. Yes, they are useful for creators, ads, short-form content and entertainment, and for ByteDance this obviously fits perfectly into CapCut, Dreamina and TikTok. But strategically, the bigger reason to train these systems may be that video is one of the richest training signals we have for learning the dynamics of the physical world: motion, causality, object permanence, spatial consistency and interaction. In that sense, video models are not just content generators, but early world models (Google, NVIDIA). Or in short: for Western labs, AI video segments for the consumer sector are too cost-inefficient with too little real benefit.

That is why I think we are currently seeing hardly any change in this area.

CapCut@capcutapp

Coming soon: Dreamina Seedance 2.5 is arriving on CapCut.

Seamless generation and editing. Up to 50 multimodal references. 30-second scenes in one shot. Finer creative control. More reliable results. It's built to make creating faster, smoother, and more intuitive.

Whether you're creating animations, short dramas, social content, marketing videos or something entirely new, the next generation of AI video creation is almost here.

And it's coming to CapCut across Web, Desktop, and Mobile.

Stay tuned.

2:22 AM · Jul 4, 2026 · 34.2K Views

Industry Shift

Western labs chase bigger strategic bets instead

Major players appear to be funneling resources toward large language models that unlock software engineering and agentic tasks, leaving video generation on the back burner for the moment.

Open Question

A competitive Western release is already on deck

Nathan Benaich notes that at least one strong alternative is slated to arrive soon, though exact timing and performance details remain unclear.

Sentiment

Positive users praise Seedance 2.5's open-source model and China's fast lead in video AI while negative users criticize Western labs' slow iteration and bad pricing.

Pos

80.0%

Neg

20.0%

11 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS906REPLIES1

Nathan Benaich@nathanbenaich

something western is coming soon and it’s v good

Chubby♨️@kimmonismus

Still, no Western text-to-video model comes close to Seedance 2.0, and Seedance 2.5 is already ready.

That is why I think we are currently seeing hardly any change in this area.

1h90600

BOOKMARKS1

pink@iJaadee

@kimmonismus I am sooo hoping @ViduAI_official gets back in the game. Because they HAD IT sooo close and I was thinking that with version four, they would finally have a true masterpiece

6h7031

LIKES4

yves@yvesai0

@kimmonismus You say: “ They are basically a nice gimmick” And then later you say they are needed for learning physics and world understanding (which I agree with so I think video models are actually extremely important) So which is it because this contradicts your whole post

7h2314

Paco@Pacoxbt

@kimmonismus world model angle is underrated

8h5983

Chubby♨️@kimmonismus

@sunglassesface thank you. I think the really important aspect is that the consumer sector is simply negligible in the race for AGI and RSI. And computing power is needed differently.

8h6042

orlie@sunglassesface

@kimmonismus 100% agreed with the copyright issues in the west and enjoy the take on the fact that the endgame is different

8h5821

Rey@ReyArtAge

@kimmonismus Oh Americans are so great , they respect the law and privacy . China bad , does not respect laws. Anyone who has this thought is stupid. Recognize that China has become a power worth of respect and deal with it. As a country, with opensource they are making everyone better.

6h1482

Vito@corleonecapital

@kimmonismus Google has no focus. The Jack of all trades and the master of none atm. Omni is dogshit for making videos. What’s its good for idk.

4h84

Todd Howard(恶俗bot)@NinePoints8

@kimmonismus 似乎sora没能找到合适的赛道，而seedance找到了，他现在每个月能赚8000万美元，而且利润率在70%

4h33

baroque obama@baroqueobama87

@corleonecapital @kimmonismus They have resources to spray and pray (unlike Anthropic who have been dialed into just text models), so don’t fault them for trying everything. But at a higher level it doesn’t connect together cohesively.

32m81

Vito@corleonecapital

@baroqueobama87 @kimmonismus True. Very frustrating. Wish they’d pick 3 focus areas and dominate instead of whatever the hell it is they’re doing.

Making my 1st AI film with Seedance, and can’t help but wish I could use American software. What a missed opportunity to democratize American story telling

39m10

Vito@corleonecapital

Yeah. My feeling is, they can afford to fall behind on world and video models, bc they have a data advantage and can catch up or leapfrog fast

Coding is different. Anthropic and OpenAI, and not cursor and Xai, have differentiated data loops. Very frustrating they don’t put their resources behind that effort in full. If they don’t catch up soon in that area it could be bad.

I feel you though. Hindsight is 20/20. Their strategy was reasonable. Now it seems questionable

22m7

Chubby♨️@kimmonismus

@Pacoxbt Especially with regard to robotics, World Models are not to be underestimated. But they are irrelevant for consumers.

8h654

NTK AI@NtokozoAI

That would be genuinely exciting.

I watched a Higgsfield Seedance 2.0 4K workflow where the snow leopard shot looked absurdly real, and that still was not 2.5!

If Western labs can match or beat that level of quality, it would be genuinely impressive.

The quality bar is already very high. The harder race may not be quality alone. It may be production economics.

A one hour film is 240 fifteen second clips. At four regeneration iterations per final clip, Seedance 2.0 4K would need about 316,800 credits on Higgsfield, roughly $15k to $20k at current rates.

That is ridiculously expensive for most creators, which means long form content creation with this tool still belongs mostly to very wealthy creators or movie studios.

The plans top out around 9,000 credits a month. So the next breakthrough is not just better video, it is making the iteration loop scale.

42m4

Hamza Khalid@humzaakhalid

@kimmonismus video was always training the world model

8h373

Cheehk@Cheehk2

@kimmonismus The possibility is high the frontier closed-models copied and benefiting from the open source models. The evidence is that US labs do not have a frontier video generation model due to the fact that Seedance or Kling do not open their models.

3h821

Ivana@ivanainai

@kimmonismus The data explanation feels almost too easy but nobody's offered a better one 🤷‍♀️

7h217

Oprèlia AI@OpreliaAI

@kimmonismus You're 98% right, but hopium is the best drug you can sell to ppl, the idea some aspiring filmmaker can access to seedance and will spend hundreds of dollars to achieve it, is exactly the customer base they want. Studios will spend credits yes, but not as much.

6h185

orlie@sunglassesface

@kimmonismus I wonder what the Chinese worldmodels look like now

7h551

Hussain Hashim | Building SundayBack@itsthedonhashim

@kimmonismus @kimmonismus maybe it's just a data thing, but honestly wonder if there's also some secret sauce in their algorithms 🤔

6h441