Correspondence, correspondence, correspondence!
Talk Argues Correspondence Drives Representation Learning More Than ImageNet Accuracy
Most Activity
And it's not just a curiosity.
Across 37 vision backbones, correspondence quality correlates more strongly with downstream tasks — segmentation, 3D pose, tracking, depth — than ImageNet kNN does.
The capability driving everything downstream is the one we measure least carefully
