/Tech37d ago

Jiaxin Wen positions vintage language models such as Talkie as baselines for testing pre-training and post-training technique interactions rather than rediscovering results like relativity

Alexander Doria prefers synthetic pretraining for controlled model behavior experiments.

630031.9K

#1537

Original post

Jiaxin Wen@jiaxinwen22#1566inTech

What's the most valuable thing you can do with vintage LMs like Talkie? I think people are misled by Demis's pitch about rediscovering Relativity. Vintage LMs are just great baselines for LM science, letting you test many hypotheses about how pre-training and post-training interact.

8:36 AM · May 23, 2026 · 1.6K Views

Sentiment

Positive users like vintage language models for enabling controlled experiments on AI rediscovering science concepts, while negative users call some models too small and undertrained to be useful.

Pos

66.7%

Neg

33.3%

3 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS299LIKES7REPLIES1

Alexander Doria@Dorialexander

@jiaxinwen22 I like vintage models a lot (likely trained the first one ever) but synthetic pretraining in general is a better frame for controlled experiments.

Jiaxin Wen@jiaxinwen22

37d29970

Aniketh@aniketthh

@jiaxinwen22 talkie is def too small and undertrained to do anything for meaningful related to ‘rediscovering science’

it doesn’t even recall/understand relativity

37d48

Jiaxin Wen@jiaxinwen22

@aniketthh I'm not saying this because of today's performance of Talkie, but just in general vintage LMs -- even future really capable ones

37d26

max!@maxsloef

@jiaxinwen22 what’s a hypothesis you’d like to test with one?

37d10

Latent Node@latent_node

@jiaxinwen22 We did something similar here -

37d6

Xinpeng Wang@XinpengWang_

@jiaxinwen22 what do you mean by 'misled'? If we have the compute and all the data (not only text) until Einstein's time, rediscovering Relativity is a good way to test if our current training recipe can lead to real insightful AI.

37d2