14h ago

Proposed Shannon Scaling Law models LLM training as a noisy channel to predict U-shaped loss curves

It outperformed Chinchilla and OpenAI laws in extrapolation tests.

0
Original post

Very typical Seed paper but what is interesting is that they get a very well generalizing scaling law

12:50 AM · May 26, 2026 View on X
Proposed Shannon Scaling Law models LLM training as a noisy channel to predict U-shaped loss curves · Digg