/Tech1h ago

Engineer Murat argues uploading screenshots to remote servers will bottleneck high-speed computer-use AI models more than inference

OpenClaw steward Peter Steinberger amplified the agent infrastructure warning

571.5K348886.1K

#382

Original post

murat 🍥@mayfer

gpt 5.6 at 750 tok/s doing computer use is going to be a little scary

8:31 AM · Jun 29, 2026 · 86.1K Views

Sentiment

Many users are excited about the productivity gains from GPT-5.6's 750 tokens per second enabling fast computer use, while others worry about high costs and prefer alternatives.

Pos

50.0%

Neg

50.0%

29 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS3.6KLIKES57

BOOTOSHI 👑@KingBootoshi

@mayfer it’s going to be SEXY AF

15h3.6K57

BOOKMARKS1

Adam Holter@AdamHoltererer

@mayfer ikr that's what I'm saying

8h5211

REPLIES2

Hudson@hudson_gri

@mayfer sonnet-5 with a cli > any model with computer use

11h3.4K4

Sharif Shameem@sharifshameem

@mayfer finally someone gets it

13h3.3K30

Mayank@mayannnkkkkkk

@mayfer Bugatti can reach 489 KM/H doesnt mean its always run on that speed 😀

10h3.6K28

hustler one@hustlerone4

@mayfer seems it would be limited by the application its driving in that case. I've been rate limited by computer use at current speeds just browsing websites

21h2.5K71

Rishi@rishi_ie

@mayfer at that speed you will be spending $25 every 20 minuites

5h1.1K13

Raghav Chandra@Raghav54321

@mayfer May have much lower context window and no vision if it's like other models on Cerebras

13h1.3K5

nikhil@itsjustmatrix

@mayfer Yeah but they have to figure out a way to make the tool calls rapid

11h3K3

Bent@benteisheuer

@mayfer It’ll be crazy but also crazy expensive

8h1.2K7

adam@hemlok_

@mayfer it depends on the latency between requests

In @framer for example, when we trialed Gemini 3.5 Flash at launch, we recorded 350 tps.

However the response latency was very high. Since agents do lots of small round trips, on balance Gemini felt marginally faster.

7h3004

maksim@ivanovm_

@mayfer scary for my credit card

6h7707

Jatin Khanna@Jatin_exe

@mayfer now imagine 10k tok/s using taalas

13h6533

callightman@CallightmanCom

@mayannnkkkkkk @mayfer The difference is you can control a Bugatti, but you can't control GPT's speed

9h1434

Ecoo@ecooai

@mayfer yeah ,it maybe solved computer use finnaly, but also too expensive to use. current a single click action even cost 6-10s for gpt5.5, this is too slow , 10x fast will make it 1-2s, this will change things

10h5551

Bhargav Koduru@Kodurubhargav1

@mayfer What's interesting is that these computers are able to keep up! Most human clicks rate is very limited. And most scripts are linear...most agents are multi-level parallel functions (with sub agents) and damn the OS yet works. Will see how it goes...

10h1.5K2