@torchcompiled It isn't useless and it isn't crazy
Any maximum likelihood method has a mixture of deltas solution, that one is actually useless
It’s crazy that the diffusion model has a closed-form solution that’s effectively useless and only capable of reproducing exact memorizations.
It’s the imperfection that lets them work, “failing gracefully” if you will
