was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching
then i realized it was from 2022
AI Judge changed title after evaluation, original title: "AI researchers revisit Minqi Jiang 2022 exploration paper"
Researchers on X note the paper's fresh arguments and relevance to self-teaching systems and current AI agent development.
was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching
then i realized it was from 2022
Positive users praise the 2022 self-teaching paper and rethinking-exploration arguments as still timely and ahead of their time for AI innovation, while a few negative users dismiss the post-2022 LLM period as cursed fever.
Nice of @jennyzhangzt to share this paper, which I selfishly think was ahead of its time. The context was that I was leaving Meta to do another startup, and thought I would not be writing papers for years. Of course, @MinqiJiang had all the good ideas + did most of the writing 😅
general Intelligence requires rethinking exploration
https://arxiv.org/abs/2211.07819
time to rethink exploration
https://arxiv.org/abs/2211.07819
was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching
then i realized it was from 2022

@willccbb I think as long as the model can't learn the concept of time, there will be no real exploration ( which BTW is why agent frontend sw thrives now, it's basically a conduit for LLM to borrow clock from the CPU). And since time points to entropy increase, all ce loss min won't work
was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching
then i realized it was from 2022
@willccbb it's time to explore
was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching
then i realized it was from 2022

@willccbb Cathedrals everywhere

@willccbb nice find. Adding Whiteboard summary (ChatGPT image gen)

@willccbb How is this different from Unsipervised Environment design? In RL there are already lots of papers and i assume some folks already managed to apply ideas from UED. Seems like this paper is just the same findings albeit from other perspective
@egrefen @jennyzhangzt Thanks for the kind words (and for originally instigating this paper)!
Talking through and rethinking these ideas together over the ~6 months spent writing this was the most transformative part of my PhD, when I figured out to a great extent what I believed as a researcher.
Nice of @jennyzhangzt to share this paper, which I selfishly think was ahead of its time. The context was that I was leaving Meta to do another startup, and thought I would not be writing papers for years. Of course, @MinqiJiang had all the good ideas + did most of the writing 😅
many such cases
was reading a paper last night that felt very timely and refreshing, like the sort of thing that was bound to anchor the next wave of innovation in self-teaching
then i realized it was from 2022

@willccbb Ref?

@egrefen @MinqiJiang definitely ahead of its time

@willccbb wandering vs exploring

@willccbb mmmm

@willccbb AI years are so cursed that 2022 already feels like ancient literature
@MinqiJiang @willccbb And rethink it.
@willccbb it's time to explore

@brandonsdinunno

@willccbb Was all llm fever after that

@willccbb Ah the open endedness bait.

@maxsbob21 @willccbb @MinqiJiang (the first author) is a pioneer of UED (was also the first author on PLR, for example)