Proposed Shannon Scaling Law models LLM training as a noisy channel to predict U-shaped loss curves
It outperformed Chinchilla and OpenAI laws in extrapolation tests.
——0——
QUOTE POST
#612Ravid Shwartz Ziv@ZIV_RAVID
I have many things to say on this paper, but I need to read it better 🧐
5:24 PM · May 26, 2026 · 2.6K Views