@andrewgwils I'm surprised at the effectiveness of RLVR for mathematics (if that is what they are using). I would love to see a detailed analysis of what knowledge is required at each stage of these long proofs. My assumption has been that LLMs can learn to instantiate general rules ...
Keep in mind that the current skeptics would have overwhelmingly said three years ago that many of the capabilities we are seeing now (e.g. solving important open problems in mathematics) would not have been achieved through these approaches.