10h ago

Cerebras runs trillion-parameter OpenAI models internally

2060217169240.0K

——0——

Cerebras operates trillion-parameter models developed by OpenAI on its dedicated inference hardware. These systems currently handle versions 5.4 and 5.5 exclusively for internal purposes at the AI firm. Company leadership indicates the infrastructure accommodates models of arbitrary scale without size restrictions.

Original post

#980@SCALING01 @RWANG07

Ray Wang@RWANG07

Cerebras CFO: "We serve all models, and there is no limit to the size of the models that we can serve. Today, we're serving trillion parameter models. We're serving trillion parameter models that are internal for OpenAI today. We are currently running OpenAI 5.4 and 5.5 with them."

5:37 PM · May 16, 2026

Cluster engagement

23 snapshots

Reposted by

#980@SCALING01

#867Alexander Doria@DORIALEXANDER

@zephyr_z9 There could be specific demand for fast/costly model but yeah harder bet if you lock on specialized hardware.

Zephyr@zephyr_z9

"We're serving trillion parameter models that are internal for OpenAI today. We are currently running OpenAI 5.4 and 5.5 with them" GPT 5.4/5.5 running internally to accelerate R&D and product development for OpenAI is very different from it being commercially viable for external inference at a trillion parameter scale

12:44 AM · May 17, 2026 · 87.9K Views

7:47 AM · May 17, 2026 · 548 Views

QUOTE POST

#1497Zephyr@ZEPHYR_Z9

"We're serving trillion parameter models that are internal for OpenAI today. We are currently running OpenAI 5.4 and 5.5 with them"

GPT 5.4/5.5 running internally to accelerate R&D and product development for OpenAI is very different from it being commercially viable for external inference at a trillion parameter scale

Ray Wang@rwang07

12:37 AM · May 17, 2026 · 154.1K Views

12:44 AM · May 17, 2026 · 87.9K Views