/Tech2h ago

Sami Jaghouar of Prime Intellect questions whether decreasing reward signals behave like loss signals in degrading automated training loops

Automated feedback loops show degrading performance across successive iterations.

122451085.2K

#832

Original post

samsja@samsja19#1316inTech

@willccbb @yacinelearning @karpathy reward going down is like loss going down no ?

will brown@willccbb

@yacinelearning @karpathy autoresearch but the training perf gets worse every iteration

2:53 PM · Jun 9, 2026 · 239 Views

/Tech2h ago

Sami Jaghouar of Prime Intellect questions whether decreasing reward signals behave like loss signals in degrading automated training loops

Automated feedback loops show degrading performance across successive iterations.

122451085.2K

#832

Original post

samsja@samsja19#1316inTech

@willccbb @yacinelearning @karpathy reward going down is like loss going down no ?

will brown@willccbb

@yacinelearning @karpathy autoresearch but the training perf gets worse every iteration

2:53 PM · Jun 9, 2026 · 239 Views

Sentiment

Users find the prospect of AI autoresearch loops showing degrading training performance each iteration hilarious and amusing.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

am.will@LLMJunky

@willccbb @yacinelearning @karpathy 🤣

3h4362

LIKES9

Noah Ziems@NoahZiems

@willccbb @yacinelearning @karpathy This is exactly what hiring phd interns are for. They've automated my job away

2h1869

RETWEETS10

will brown@willccbb

@yacinelearning @karpathy autoresearch but the training perf gets worse every iteration

3h5.3K2388

q@gradientdespair

@willccbb @yacinelearning @karpathy AutoDegradation by Anthropic. Thanks @karpathy

3h1253

Golden Hippie@gamestoneai

@willccbb @yacinelearning @karpathy AI impostor syndrome. Model sandbags its own training runs.

2h120

Venkat — inference & RL/acc@venkat_systems

@willccbb @yacinelearning @karpathy

2h95

Trash Panda 🦝@trashpandaemoji

@willccbb @max_paperclips @yacinelearning @karpathy I would just simply do the opposite.

2h57

arXiv Bangers@arXivBangers

@willccbb @yacinelearning @karpathy B-Banger…

1h141

chris@candyflipline

@willccbb @yacinelearning @karpathy wait that would be hilarious

2h39

vincenzo@alargemike

@willccbb @max_paperclips @yacinelearning @karpathy that’s right

1h9