/Tech4h ago

Neil Chowdhury corrects Benjamin Anderson on scaling laws, showing test loss scales as a power law of compute rather than logarithmically

Chowdhury plotted compute in petaflop-days against test loss.

51000586

Original post

Ben (no treats)@andersonbcdefg#1052inTech

to be fair: pretraining loss scales logarithmically with training compute. capabilities aren't linear in pretraining loss, though. each marginal reduction in loss gets harder and potentially unlocks more capabilities. dario's claim is still insane though

Josh@JoshPurtell

Good call out. *Everyone* knows capabilities scale logarithmically with computing power.

This is the second paragraph. How on earth does a mistake like this make it in?

10:57 PM · Jun 10, 2026 · 326 Views

Sentiment

Users thanked the critic for flagging the scaling laws error in Dario Amodei's essay.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS240LIKES2

Ben (no treats)@andersonbcdefg

thanks @cis_female

Ben (no treats)@andersonbcdefg

to be fair: pretraining loss scales logarithmically with training compute. capabilities aren't linear in pretraining loss, though. each marginal reduction in loss gets harder and potentially unlocks more capabilities. dario's claim is still insane though

4h24020

REPLIES2

Neil Chowdhury@ChowdhuryNeil

@andersonbcdefg it's a power law, not logarithmic

Ben (no treats)@andersonbcdefg

to be fair: pretraining loss scales logarithmically with training compute. capabilities aren't linear in pretraining loss, though. each marginal reduction in loss gets harder and potentially unlocks more capabilities. dario's claim is still insane though

4h2020

Ben (no treats)@andersonbcdefg

@aspergtame allow me to speak with you about phylogenomics

4h61