16h ago

Researcher Releases PyTorch Implementation Of Gradient Moment Metric

0
Original post

Since I'm between jobs, I've been having a lot of fun vibe-coding with public tooling. First drop: a clean PyTorch impl of the Gradient Moment metric from our recent paper (arXiv:2603.20155). https://github.com/ehoogeboom/gradient-moment

6:49 AM · May 16, 2026 View on X

The motivation: for models without a tractable likelihood (distilled discrete diffusion, in our case), generative PPL is easy to game by sampling at low entropy. You get "better" PPL by being repetitive.

GM uses the gradient of a reference LM's NLL instead.

Emiel HoogeboomEmiel Hoogeboom@emiel_hoogeboom

Since I'm between jobs, I've been having a lot of fun vibe-coding with public tooling. First drop: a clean PyTorch impl of the Gradient Moment metric from our recent paper (arXiv:2603.20155). https://github.com/ehoogeboom/gradient-moment

1:57 PM · May 16, 2026 · 3.8K Views
1:57 PM · May 16, 2026 · 416 Views

Since I'm between jobs, I've been having a lot of fun vibe-coding with public tooling. First drop: a clean PyTorch impl of the Gradient Moment metric from our recent paper (arXiv:2603.20155). https://github.com/ehoogeboom/gradient-moment

1:49 PM · May 16, 2026 · 7 Views

The motivation: for models without a tractable likelihood (distilled discrete diffusion, in our case), generative PPL is easy to game by sampling at low entropy. You get "better" PPL by being more repetitive.

GM uses the gradient of a reference LM's NLL instead.

Emiel HoogeboomEmiel Hoogeboom@emiel_hoogeboom

Since I'm between jobs, I've been having a lot of fun vibe-coding with public tooling. First drop: a clean PyTorch impl of the Gradient Moment metric from our recent paper (arXiv:2603.20155). https://github.com/ehoogeboom/gradient-moment

1:49 PM · May 16, 2026 · 7 Views
1:49 PM · May 16, 2026 · 1 Views
Researcher Releases PyTorch Implementation Of Gradient Moment Metric · Digg