/Tech28d ago

A 2022 arXiv paper titled 'General Intelligence Requires Rethinking Exploration' calls for shifting from learning from data to learning what data to learn from and new methods beyond reinforcement learning

AI Judge changed title after evaluation, original title: "AI researchers revisit Minqi Jiang 2022 exploration paper"

Researchers on X note the paper's fresh arguments and relevance to self-teaching systems and current AI agent development.

306623131457.3K

#71

Original post

will brown@willccbb#573inTech

was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching

then i realized it was from 2022

10:06 AM · May 16, 2026 · 33.4K Views

Sentiment

Positive users praise the 2022 self-teaching paper and rethinking-exploration arguments as still timely and ahead of their time for AI innovation, while a few negative users dismiss the post-2022 LLM period as cursed fever.

Pos

81.2%

Neg

18.8%

12 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS14.7KREPLIES7

Edward Grefenstette@egrefen

Nice of @jennyzhangzt to share this paper, which I selfishly think was ahead of its time. The context was that I was leaving Meta to do another startup, and thought I would not be writing papers for years. Of course, @MinqiJiang had all the good ideas + did most of the writing 😅

Jenny Zhang@jennyzhangzt

general Intelligence requires rethinking exploration

https://arxiv.org/abs/2211.07819

28d14.7K8043

BOOKMARKS162LIKES140RETWEETS13

will brown@willccbb

time to rethink exploration

https://arxiv.org/abs/2211.07819

will brown@willccbb

was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching

then i realized it was from 2022

28d7.5K140162

Zhipeng Huang@nopainkiller

@willccbb I think as long as the model can't learn the concept of time, there will be no real exploration ( which BTW is why agent frontend sw thrives now, it's basically a conduit for LLM to borrow clock from the CPU). And since time points to entropy increase, all ce loss min won't work

28d24942

will brown@willccbb

was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching

then i realized it was from 2022

28d33.4K424109

Minqi Jiang@MinqiJiang

@willccbb it's time to explore

will brown@willccbb

was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching

then i realized it was from 2022

27d57470

0.005 Seconds (3/694)@seconds_0

@willccbb Cathedrals everywhere

28d6088

Bryan Craven@bryancrav

@willccbb nice find. Adding Whiteboard summary (ChatGPT image gen)

28d5023

Maxim Bobrin@maxsbob21

@willccbb How is this different from Unsipervised Environment design? In RL there are already lots of papers and i assume some folks already managed to apply ideas from UED. Seems like this paper is just the same findings albeit from other perspective

27d147

Minqi Jiang@MinqiJiang

@egrefen @jennyzhangzt Thanks for the kind words (and for originally instigating this paper)!

Talking through and rethinking these ideas together over the ~6 months spent writing this was the most transformative part of my PhD, when I figured out to a great extent what I believed as a researcher.

Edward Grefenstette@egrefen

28d29270

Andrew Drozdov@mrdrozdov

many such cases

will brown@willccbb

was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching

then i realized it was from 2022

28d75430

compliantBD@brandonsdinunno

@willccbb Ref?

28d145

Jenny Zhang@jennyzhangzt

@egrefen @MinqiJiang definitely ahead of its time

28d1463

shriya@shriyalola

@willccbb wandering vs exploring

28d2781

Tim Kostolansky@thkostolansky

@willccbb mmmm

28d1791

Anubhav@Anubhavhing

@willccbb AI years are so cursed that 2022 already feels like ancient literature

28d568

Edward Grefenstette@egrefen

@MinqiJiang @willccbb And rethink it.

Minqi Jiang@MinqiJiang

@willccbb it's time to explore

27d16810

will brown@willccbb

@brandonsdinunno

28d1261

Sidharth Giri@helmholtzigga

@willccbb Was all llm fever after that

28d329

Theodore Galanos@TheodoreGalanos

@willccbb Ah the open endedness bait.

28d273

nathan monette@nathanrmonette

@maxsbob21 @willccbb @MinqiJiang (the first author) is a pioneer of UED (was also the first author on PLR, for example)

27d501