Though the project itself is about the transparency of DiffusionGemma, I'm most excited about this as an example of what it could look like to assess the transparency of latent reasoning models, in a manner that lets us compare to autoregressive CoT
Text diffusion models are fast, but are less transparent than today's LLMs because they do many forward passes before outputting text.
We audit the transparency of DiffusionGemma and find that the intermediates are interpretable. This recovers many of the benefits of CoT!
🧵