/Tech4h ago

Neil Chowdhury corrects Benjamin Anderson on scaling laws, showing test loss scales as a power law of compute rather than logarithmically

Chowdhury plotted compute in petaflop-days against test loss.

51000586
Original post
Ben (no treats)@andersonbcdefg#1052inTech

to be fair: pretraining loss scales logarithmically with training compute. capabilities aren't linear in pretraining loss, though. each marginal reduction in loss gets harder and potentially unlocks more capabilities. dario's claim is still insane though

Josh@JoshPurtell

Good call out. *Everyone* knows capabilities scale logarithmically with computing power.

This is the second paragraph. How on earth does a mistake like this make it in?

10:57 PM · Jun 10, 2026 · 326 Views
Sentiment

Users thanked the critic for flagging the scaling laws error in Dario Amodei's essay.

Pos
100.0%
Neg
0.0%
1 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS240LIKES2
Ben (no treats)@andersonbcdefg

thanks @cis_female

Ben (no treats)@andersonbcdefg

to be fair: pretraining loss scales logarithmically with training compute. capabilities aren't linear in pretraining loss, though. each marginal reduction in loss gets harder and potentially unlocks more capabilities. dario's claim is still insane though

4hViews 240Likes 2Bookmarks 0
REPLIES2
Neil Chowdhury@ChowdhuryNeil

@andersonbcdefg it's a power law, not logarithmic

Ben (no treats)@andersonbcdefg

to be fair: pretraining loss scales logarithmically with training compute. capabilities aren't linear in pretraining loss, though. each marginal reduction in loss gets harder and potentially unlocks more capabilities. dario's claim is still insane though

4hViews 20Likes 2Bookmarks 0
Ben (no treats)@andersonbcdefg

@aspergtame allow me to speak with you about phylogenomics

4hViews 6Likes 1