Can we evaluate if LLMs can reason structurally?
I discussed this question through the lens of data structures in a talk at @SimonsInstitute. In a joint ICML'26 work with @yuhe441, Yingxi Li, and @crwhite_ml, we introduce DSR-Bench, the data structure reasoning benchmark. (1/9)