18h ago

Jasper Gilley shows neural networks cycle through learning and forgetting, with correct solutions acting as unstable saddle points

His methods can force grokked models to forget.

410323.1K

——0——

Original post

#282@DHRUVBATRA_OP

Jasper Gilley@0XJASPER

Here's the video of my talk @southpkcommons Demo Day! Featuring all new visualizations for why grokking works, how you can make grokked models forget, and what this says about memorization in LLMs

10:37 AM · May 26, 2026

QUOTE POST

#1142Abhishek Das@ABHSHKDZ

Go @0xjasper!

Jasper Gilley@0xjasper

Here's the video of my talk @southpkcommons Demo Day! Featuring all new visualizations for why grokking works, how you can make grokked models forget, and what this says about memorization in LLMs

5:37 PM · May 26, 2026 · 2.3K Views

2:06 AM · May 27, 2026 · 717 Views

Jasper Gilley shows neural networks cycle through learning and forgetting, with correct solutions acting as unstable saddle points

Sentiment

Cluster engagement