/Tech1h ago

AI Enables New Methods to Study Intelligence and Cognition

4511398

Original post

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen#985inTech

Livetweet @mcxfrank talk: If we care about intelligence and cognition, AI has recently allowed a change; we now have two ways to study them. AI is of course allowing us a lot that we wouldn't dare on our Children (brain surgery, never tell a child about cats...) #acl #conll

1:53 PM · Jul 3, 2026 · 283 Views

Sentiment

Users are excited about AI enabling new methods to study intelligence and cognition because it supports amazing participatory data collection efforts like Wordbank.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Related links

Updated Rules for BabyLM Round 4

BABYLM.GITHUB.IOVia

#985

Posts from X

Most Activity

VIEWS46REPLIES1

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

This also raises the differences between the two (hypotheses in pic). Why do models need so much more data for a similar result? Why can humans learn much more efficiently (better is debatable)? (Known as the @babyLMchallenge c.f. https://babylm.github.io/ if you're interested)

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

1h4600

LIKES1

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

@mcxfrank (amazing) works also create huge participatory data collection efforts. Don't miss them. Wordbank gives you the words children around the world know (at 16-30mo). https://wordbank.stanford.edu/

1h191

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

Maybe child data is simply more useful? They checked on tinydialogues, childes and babyLM, for LLMs the data children hear might (?) be useful for children, but it is worse for LLMs. Frank mentions it is less diverse, I'll also add less challenging, motivation is a human issue.

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

1h4100

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

So languages do not interfere. Well sure, but don't you expect to improve, knowledge/skills and anything nonlinguistic? The knowledge you learn helps across the languages. This questions bothers me lately, as you probably noted in my papers, social etc.

1h27

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

I wonder if some children accelerate faster, are there connections between starting time and acceleration. Do non noise outliers exist? We think of teaching more, but can we teach "learning" even at the cost of a slower start?(p.s. this is on words, but we've got to start somwe..

1h18

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

Sadly I missed part of the bilingual (I am that dedicated to sharing his talk here) but iiuc they show that learning also on another language doesn't reduce the ability to learn language.

1h16

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

Maybe children get their language in the right order? Well maybe, but for models it's worse than random (which as Michael said, by now there's compounding evidence, curriculum is hard to make any change, see his work, babyLM findings and the award winning negative results).

1h13

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

The results from that easily give you worldwide comparisons, yes girls learn more worlds early on, children learning accelerates etc.

1h10

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

A similar issue is in multimodal, adding a lot of images doesn't improve your "intelligence"(pic). You need very sophisticated training (that they create) to force the images to improve anything for the text.

1h10

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

Training on a large amount of data (not only children) sounds promising, but apparently, the mismatch between what you see and what is said is too great.

1h10

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

Databrary is the effort of recording view and sounds of babies. Collected by many so far they will soon reach the amount of data of a whole (wake-time) year! https://databrary.org/

1h17