> but this would mean the margin on these models is CRAZY compared to oss models
…yes, it is Furthermore, given that they have vastly better hardware than the Chinese, but WE don't get 300 t/s on GPT 5.5, they must serve this crap with insane batch sizes it's a money machine
i have a hard time believing this. at the same time it would make sense that the CEO of cursor know the size of closed models, but this would mean the margin on these models is CRAZY compared to oss models.
this plot doesn't even account for the fact that serving at scale reduces cost (much higher batch, better hardware) and that they have many more engineers working on optimizing the inference stack ect.. (this is a log-log plot btw)
