/AI8h ago

Thomas Serre Delivers Keynote on Scaling Laws vs Neural Laws at CVPR

13374143.9K
Original post
Andrei Bursuc @CVPR@abursuc#1577inAI

.@tserre starts his keynote on Scaling Laws vs. Neural laws: towards more natural artificial vision #cvpr2026

9:39 AM · Jun 7, 2026 · 1.5K Views
Sentiment

Users express enthusiasm for throwbacks to historical computer vision papers on hierarchical models highlighted during Thomas Serre's CVPR keynote contrasting scaling laws and neural laws.

Pos
100.0%
Neg
0.0%
1 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS625BOOKMARKS5LIKES8RETWEETS3

Last keynote: "Scaling Laws vs. Neural Laws: Toward More Natural Artificial Vision" kicking off now with @tserre

7hViews 625Likes 8Bookmarks 5
REPLIES1

Quick demo to assess the gist capabilities of recognizing animal vs non-animal from a few ms of image display. Who remembers the GIST descriptors from the 2000s? #cvpr2026

.@tserre starts his keynote on Scaling Laws vs. Neural laws: towards more natural artificial vision #cvpr2026

8hViews 568Likes 4Bookmarks 0

On learning simple long range dependencies in images, small ResNets work, but ViTs struggle #cvpr2026

One of the things that we can look at is improving the feedback mechanism in our architectures to mimic recurrent dynamics and circuit models from the human brain. Let’s make our models recurrent again? #cvpr2026

7hViews 149Likes 3Bookmarks 2

Loving this throwbacks of papers from the past of computer vision: hierarchical vision models #cvpr2026

Quick demo to assess the gist capabilities of recognizing animal vs non-animal from a few ms of image display. Who remembers the GIST descriptors from the 2000s? #cvpr2026

8hViews 367Likes 2Bookmarks 0

The AlexNet moment was not pivotal only to computer vision community, but also to neuroscience researchers. Better AI initially meant better neuroscience models #cvpr2026

Loving this throwbacks of papers from the past of computer vision: hierarchical vision models #cvpr2026

8hViews 96Likes 0Bookmarks 0

Taking a look on how babies interact with new objects and where is their gaze at #cvpr2026

On some other metrics for behavioral alignment on areas of focus in the image, they observe the same trend #cvpr2026

8hViews 87Likes 2Bookmarks 0

On some other metrics for behavioral alignment on areas of focus in the image, they observe the same trend #cvpr2026

However, in the second wave of models(big CNNs, ViTs) and the multi-labeling of ImageNet: the correlation between ImageNet accuracy and Neural Alignment does not hold. Similar story for SSL models, thouggh DINOv3 is better than JEPAs #cvpr2026

8hViews 81Likes 2Bookmarks 0

starting with the tl;dr: as we in computer vision are both benchmaxxing and (successfully) improving real performance, we are making our vision systems "more artificial" (less human like).

this talk will advocate ways to make our artificial vision systems less artificial

7hViews 110Likes 1

The same trend is observed for their proposed stability score #cvpr2026

They study on Co3D which learning objective for SSL lead to better human feature importance alignement. CNNs are in general better. On training objectives, autoregressive one boost alignment significantly #cvpr2026

8hViews 102Likes 1Bookmarks 0

One of the things that we can look at is improving the feedback mechanism in our architectures to mimic recurrent dynamics and circuit models from the human brain. Let’s make our models recurrent again? #cvpr2026

The same trend is observed for their proposed stability score #cvpr2026

8hViews 96Likes 1Bookmarks 0

analysis been repeated in 2026 using many models from timm (h/t @wightmanr).

neural alignment is computed between primates and the models.

we see that CNNs (red dots) and especially w/ adversarial training (yellow dots) show positive correlation between perf and alignment

7hViews 18Likes 2

However, in the second wave of models(big CNNs, ViTs) and the multi-labeling of ImageNet: the correlation between ImageNet accuracy and Neural Alignment does not hold. Similar story for SSL models, thouggh DINOv3 is better than JEPAs #cvpr2026

The AlexNet moment was not pivotal only to computer vision community, but also to neuroscience researchers. Better AI initially meant better neuroscience models #cvpr2026

8hViews 138Likes 0Bookmarks 0

They study on Co3D which learning objective for SSL lead to better human feature importance alignement. CNNs are in general better. On training objectives, autoregressive one boost alignment significantly #cvpr2026

Taking a look on how babies interact with new objects and where is their gaze at #cvpr2026

8hViews 87Likes 0Bookmarks 0

start with a demo: rapidly flashing images with either animal or non-animal images very quickly and showing that we can all identify them

7hViews 79

visual importance data also collected from humans by crowd sourcing data from volunteers.

we see (via gradient based saliencey) that modern models attend quite differently than humans (more diffuse, less interpretable).

human <-> model results mirror primate results closely

7hViews 18Likes 1

Enter State Space Models (gated delta net) which allow parallelizing over the sequence and preserving recurrence.

SSM / GDN yields a new pareto frontier for these models

7hViews 46

one might think this only applies to CNNs (because architecturally it is true that CNNs grow receptive field with depth), but even models with global attention at each layer (ViT) empirically fail at the pathfinder task

7hViews 35

interestingly - it became clear that abandoning biological inspiration not only improved absolute performance, but also improve correlation with observed activations in primate visual systems (at least at first).

this eval was performed in 2018

7hViews 23

of course AlexNet changed everything. Not only in CV, but also in neuro-bio for vision

7hViews 22

some history: pre-AlexNet there were some biologically inspired hierarchical vision models

7hViews 22
Load more posts