Mind-blowing hardware breakthrough:
An open source garage engineer burned a full AI Transformer model (with KV cache) directly into a custom digital chip:
WITH NO GPU, NO CPU, NO CLOUD.
Just pure silicon running microGPT at 56,000+ tokens/sec on only 80 MHz!
And uses less energy than a calculator.
Prototyped on FPGA, now spelling names on a tiny LCD. This is GateGPT and a big future of on-device AI is here.
This can and will scale to far larger models.
Insane efficiency. Pure digital magic.
56,000+ tokens/sec at just 80 MHz. 🤯
I burned a full Transformer with KV cache into a custom chip. Designed gate by gate as a 100% digital integrated circuit. Prototyped on a FPGA. (No GPU. No CPU) Just pure digital silicon running @karpathy microGPT, spelling out names on a tiny LCD.
This is GateGPT 👇

















