SemiAnalysis posted data from 174,264 agentic coding sessions showing 42% of runtime on CPU tasks versus 58% on GPU inference and highlighted cloud pricing mismatches with per-token monetization
Median per-turn time measured 5.13 seconds.
——0——
QUOTE POST
#687Bojan Tunguz@TUNGUZ
Very important.
FACT ALERT 🚨 : In modern agentic coding, 42% of the time is spent on CPU doing tool use such as editing files, running Bash scripts, running lints, etc. The economy of traditional cloud computing charges at $ per cpu core. In the economy of agents, the business model is $ per token thus to increase token revenue, you need to increase the amount of CPUs power u have so that you can generate your tokens.
2:00 PM · May 23, 2026 · 31.8K Views
3:03 PM · May 23, 2026 · 2.6K Views