Researcher Releases PyTorch Implementation Of Gradient Moment Metric

ORIGINAL POST

#1920Emiel Hoogeboom@EMIEL_HOOGEBOOM

Since I'm between jobs, I've been having a lot of fun vibe-coding with public tooling.

First drop: a clean PyTorch impl of the Gradient Moment metric from our recent paper (arXiv:2603.20155).

github.com

GitHub - ehoogeboom/gradient-moment

Contribute to ehoogeboom/gradient-moment development by creating an account on GitHub.

1:57 PM · May 16, 2026 · 3.8K Views

REPLY

#1920Emiel Hoogeboom@EMIEL_HOOGEBOOM

The motivation: for models without a tractable likelihood (distilled discrete diffusion, in our case), generative PPL is easy to game by sampling at low entropy. You get "better" PPL by being repetitive.

GM uses the gradient of a reference LM's NLL instead.

Emiel Hoogeboom@emiel_hoogeboom

Since I'm between jobs, I've been having a lot of fun vibe-coding with public tooling. First drop: a clean PyTorch impl of the Gradient Moment metric from our recent paper (arXiv:2603.20155). https://github.com/ehoogeboom/gradient-moment

1:57 PM · May 16, 2026 · 3.8K Views

1:57 PM · May 16, 2026 · 416 Views

ORIGINAL POST

#1920Emiel Hoogeboom@EMIEL_HOOGEBOOM

Since I'm between jobs, I've been having a lot of fun vibe-coding with public tooling. First drop: a clean PyTorch impl of the Gradient Moment metric from our recent paper (arXiv:2603.20155). https://github.com/ehoogeboom/gradient-moment

1:49 PM · May 16, 2026 · 7 Views

REPLY

#1920Emiel Hoogeboom@EMIEL_HOOGEBOOM

The motivation: for models without a tractable likelihood (distilled discrete diffusion, in our case), generative PPL is easy to game by sampling at low entropy. You get "better" PPL by being more repetitive.

GM uses the gradient of a reference LM's NLL instead.

Emiel Hoogeboom@emiel_hoogeboom

Since I'm between jobs, I've been having a lot of fun vibe-coding with public tooling. First drop: a clean PyTorch impl of the Gradient Moment metric from our recent paper (arXiv:2603.20155). https://github.com/ehoogeboom/gradient-moment

1:49 PM · May 16, 2026 · 7 Views

1:49 PM · May 16, 2026 · 1 Views

Researcher Releases PyTorch Implementation Of Gradient Moment Metric

Cluster engagement

Sentiment

Cluster engagement