/Tech2h ago

Zhonghao Xinying, founded by a former Google TPU engineer, launches a chip delivering 896 TFLOPS at 600W

The chip natively supports PyTorch, vLLM, and SGLang.

118036.2K

#530

Original post

Zephyr@zephyr_z9#1700inTech

Hmmm... Interesting

tphuang@tphuang

中昊芯英 unveils next gen TPU AI chip as part of 泰则 2.0 compute cluster that will have 896 TFLOPS compute per chip (1792 TOPS INT8). Each chip uses just 600W of power.

In a 8 TPU + 2 CPU setup, box will have 7.168 PFLOPS of compute. Natively supports PyTorch, vLLM, SGLang & other tools.

Completed integration w/ Qwen, DeepSeek, GLM & Minimax.

Company uses Chiplet + 2.5D packaging & connected in a cluster via optical modules & w/ OCS.

Founder/CEO 杨龚轶凡 was a core member of Google TPU design team for v2/3/4.

This does feel like the Chinese Google TPU.

6:33 PM · Jun 30, 2026 · 5.6K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS636LIKES3

Beff (e/acc)@beffjezos

"Founder is ex Google TPU team"