馃敟 New paper: Fixed-Point Masked Generative Modeling
Masked generative models are becoming a very exciting alternative to autoregressive generation, especially for language.
They decode in parallel, but every denoising step still runs a full bidirectional Transformer.
We make them cheaper and stronger with fixed-point denoisers 馃У
w/ @qinym710 @AlbaCbCs @jdeschena and @pafrossard (1/12)