New blog post: The Forgetting Wall in Video and World Models
Long-horizon video generation is not just limited by compute. It is limited by how much of its own past the model can afford to remember.
I wrote about why long videos drift, why KV cache becomes the memory bottleneck, and why compression is a key direction for future video/world models.
https://haochengxi.github.io/posts/forgetting-wall/

