Richard Sutton, University of Alberta professor, posts 26-word distillation of bitter lesson in AI
Richard Sutton posted a 26-word summary of the bitter lesson urging researchers to favor scalable search and learning methods over embedded human knowledge. Aviv Tamar quote-tweeted the post and called the source material probably the best position paper in machine learning. The exchange focused on how computation-driven techniques have historically outperformed approaches that rely on domain-specific human expertise.
@agarwl_ discarding humand data (be it in LLM or in non-trivial "classic" RL env) is just complete nonsense. Even if Rich says it. But I'm not sure he's actually saying this here.
Guess he should use more than 26 words, but then it wouldn't sound Ilya-style mysterious anymore :)
Perplexed by this take: Sure, let's not mainly do supervise learning on human knowledge, but it makes sense to build off it instead of the *let's do it from scratch*. People cite AlphaGo vs AlphaGo Zero as a quintessential example of how using human-generating data is suboptimal but it was *imitating* it that was suboptimal. What if we learned from that data assuming it was suboptimal in the first place (so not supervised learning but RL like mindset of using that data)
@RichardSSutton I think it is worth studying human knowledge, particularly understanding the structure of its abstractions, as they can provide guidance about the kinds of things humans learn and machines do not (yet). I wouldn't call that "distraction".
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
The bitter lesson in 26 words:
Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
Also abstractions & knowledge curation.
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
i wonder whether this needs an update.
current methods, such as they are, leverage massive amounts of human knowledge as their primary fuel. they would be lost without it.
and they even build some knowledge into their system prompts.
and lately they build knowledge into their harnesses, usually by over 50 tools that have been carefully crafted with human knowledge.
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
Perplexed by this take: Sure, let's not mainly do supervise learning on human knowledge, but it makes sense to build off it instead of the *let's do it from scratch*.
People cite AlphaGo vs AlphaGo Zero as a quintessential example of how using human-generating data is suboptimal but it was *imitating* it that was suboptimal.
What if we learned from that data assuming it was suboptimal in the first place (so not supervised learning but RL like mindset of using that data)
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
bitter and sad
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
GOD GAVE US THE UNIVERSE, THE ORACLE. WE MUST MINE IT
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
Natural language is human-created representation of the world.
Is the ultimate form of the bitter lesson to bypass natural language entirely and learn a new representation from the world itself?
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
@jiaxinwen22 how do you explain point 3 given that debate is empirically hard, LLM judge systematically fails to make reliable more subjective judgements (which is basically what taste is), etc? why doesn't this just result in the slop problems we see now when people try and scale SWE TTC?
I might be one of the few people who is most bearish on human research taste and bullish on automated research: - "AIs can only do hyperparameter search" is mainly a skill issue with bad automated research setups. - human taste is overrated, e.g. frontier labs / neolabs are doing pretty simlar things. - human taste might win in a low-compute world, but not a high-compute world we're entering.
@jiaxinwen22 median human taste is not great. the tails are pretty good, but for the purposes of "will AI replace a profession" the media is not hard to beat.
I might be one of the few people who is most bearish on human research taste and bullish on automated research: - "AIs can only do hyperparameter search" is mainly a skill issue with bad automated research setups. - human taste is overrated, e.g. frontier labs / neolabs are doing pretty simlar things. - human taste might win in a low-compute world, but not a high-compute world we're entering.
However, I hope humans keep doing our own research, with *strong* tastes and priors. Not all research is about outcomes. Sometimes you just want to solve/understand a problem in a way that feels like yours.
I might be one of the few people who is most bearish on human research taste and bullish on automated research: - "AIs can only do hyperparameter search" is mainly a skill issue with bad automated research setups. - human taste is overrated, e.g. frontier labs / neolabs are doing pretty simlar things. - human taste might win in a low-compute world, but not a high-compute world we're entering.
I might be one of the few people who is most bearish on human research taste and bullish on automated research: - "AIs can only do hyperparameter search" is mainly a skill issue with bad automated research setups. - human taste is overrated, e.g. frontier labs / neolabs are doing pretty simlar things. - human taste might win in a low-compute world, but not a high-compute world we're entering.
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
@menhguin If the job of top human experts is just to write a few high-level directions and ask AIs to solve, I’d still call it automated research.
@jiaxinwen22 median human taste is not great. the tails are pretty good, but for the purposes of "will AI replace a profession" the media is not hard to beat.
Probably the best position paper in ML, now fits in a tweet
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
@agarwl_ the key issue seems to be “we’re operating inside of a bad box, here’s the good box” people like to agree on the second part, but few internalize why “human knowledge” is so attractive and why “how humans learn” is another humanization
Perplexed by this take: Sure, let's not mainly do supervise learning on human knowledge, but it makes sense to build off it instead of the *let's do it from scratch*. People cite AlphaGo vs AlphaGo Zero as a quintessential example of how using human-generating data is suboptimal but it was *imitating* it that was suboptimal. What if we learned from that data assuming it was suboptimal in the first place (so not supervised learning but RL like mindset of using that data)
@RichardSSutton The center of mass of AI history is like six month ago, so I'd say it was mostly about LLMs and learning from humans
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.