1d ago

ZyphraAI converts ZAYA1-8B-base model to diffusion LLM

28806952691.1M

——0——

ZyphraAI converted its ZAYA1-8B-base autoregressive model to a diffusion LLM through mid-training rather than training from scratch. The company applied a TiDAR-based diffusion-conversion step followed by diffusion supervised fine-tuning on its existing stack. The resulting model diffuses 16-token blocks in a single step from a mask prior, matches autoregressive logits via speculative decoding, and mixes logits during sampling to deliver speedups over earlier methods such as TiDAR. Platform discussion noted diffusion language models increasingly adopting autoregressive traits on smaller sequential token blocks.

Original post

#999@ANDERSONBCDEFG @ZYPHRAAI

Zyphra@ZYPHRAAI

We present ZAYA1-8B-Diffusion-Preview, the first diffusion language model trained on @AMD. Autoregressive LLMs generate one token at a time; diffusion generates a block in parallel, speeding up inference. We show a 4.6-7.7x decoding speedup with minimal quality degradation 🧵

2:33 PM · May 14, 2026

Cluster engagement

3 snapshots

Reposted by

#999@ANDERSONBCDEFG

#158@EMOSTAQUE

QUOTE POST

#400Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@TEORTAXESTEX

Yuxi keeps winning

9:37 PM · May 14, 2026 · 5.9K Views

QUOTE POST

#400Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@TEORTAXESTEX

That's a cool way to explain the logistical salience of the trick, but let's be honest, such "diffusion endpoint" is just the logical endpoint of speculative decoding for AR. Pure diffusion language models had another… more confused promise.

11:43 PM · May 14, 2026 · 2.5K Views

#841kalomaze@KALOMAZE

@teortaxesTex >ask the performant dlm author if its seqlevel denoising or block-causal >"its a good diffusion language model, sir" >look inside >its block-causal

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Yuxi keeps winning

9:37 PM · May 14, 2026 · 5.9K Views

10:18 PM · May 14, 2026 · 704 Views

QUOTE POST

#913Anirudh Goyal@ANIRUDHG9119

Cool.

Zyphra@ZyphraAI

9:33 PM · May 14, 2026 · 1.1M Views

10:26 PM · May 14, 2026 · 940 Views