Greg Kamradt says the ARC-AGI-3 benchmark tests schema expansion by introducing cumulative new mechanics at each level · Digg