14h ago

Proposed Shannon Scaling Law models LLM training as a noisy channel to predict U-shaped loss curves

It outperformed Chinchilla and OpenAI laws in extrapolation tests.

2341236.4K

——0——

Original post

Very typical Seed paper but what is interesting is that they get a very well generalizing scaling law

QUOTE POST

I have many things to say on this paper, but I need to read it better 🧐

5:24 PM · May 26, 2026 · 2.6K Views