/Tech3h ago

Teortaxes proposes automating AI research with models running dynamic grid search to optimize compute cluster utilization

Creator bayeslord endorsed the shift toward automated empirical scaling.

563196.5K

#214

Original post

bayes@bayeslord#693inTech

@tszzl 100%

roon@tszzl

the way humans do ai research is highly empirical, but it is possible there are very outsized theoretical and mechanistic improvements in model training. even the gradations of skill among human researchers mean some create 10,000x more progress given a compute budget. some invent the transformer or PPO

this is the ilya sutskever “age of research” bet, that you can find massive improvements on small models and small training runs. if Ilya thinks so maybe GPT7 and Claude Requiem think so too.

many of the brightest researchers don’t do fundamental deep learning research anymore. most have stopped being curious as to what a neural net is the way they used to be in 2017. probably because incremental engineering-based progress has been so guaranteed and low-hanging.

the rate and cost of progress today doesn’t necessarily predict the speed of RSI loop

9:28 PM · Jun 9, 2026 · 652 Views

/Tech3h ago

Teortaxes proposes automating AI research with models running dynamic grid search to optimize compute cluster utilization

Creator bayeslord endorsed the shift toward automated empirical scaling.

563196.5K

#214

Original post

bayes@bayeslord#693inTech

@tszzl 100%

roon@tszzl

this is the ilya sutskever “age of research” bet, that you can find massive improvements on small models and small training runs. if Ilya thinks so maybe GPT7 and Claude Requiem think so too.

the rate and cost of progress today doesn’t necessarily predict the speed of RSI loop

9:28 PM · Jun 9, 2026 · 652 Views

Sentiment

Users sarcastically dismiss automated AI research removing human bias as misguided, mocking impractical steps like kernel optimization experiments before scaling checks.

Pos

0.0%

Neg

100.0%

1 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS4.6KBOOKMARKS9LIKES36RETWEETS1REPLIES3

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

The cynical thesis for automated research is that if you remove the human researcher, you can utilize the cluster better. Fable/GPT-5.6 can be trained to maximize information gain with grid search. Smaller experiments, more dynamic, without perf confounders. Better Than Taste.

roon@tszzl

this is the ilya sutskever “age of research” bet, that you can find massive improvements on small models and small training runs. if Ilya thinks so maybe GPT7 and Claude Requiem think so too.

the rate and cost of progress today doesn’t necessarily predict the speed of RSI loop

2h4.6K369

Rohan Pandey@khoomeik

@tszzl i want to believe

very hard to forecast (and credit-assign) these kinds of breakthroughs

and even if models can conjecture 100s of plausible breakthroughs, they may still require significant compute to derisk at scale

roon@tszzl

this is the ilya sutskever “age of research” bet, that you can find massive improvements on small models and small training runs. if Ilya thinks so maybe GPT7 and Claude Requiem think so too.

the rate and cost of progress today doesn’t necessarily predict the speed of RSI loop

2h658140

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

*Claude devote 5% of the budget to optimizing kernels for every arch experiment before checking how it scales. make no mistakes* This is how you go from "oh noes DeepSeek is OOMs cheaper, necessity is the mother of invention" peasant morality fable to alien tech in months

2h1.2K81

kache@yacineMTB

@teortaxesTex Unironically though

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

2h75360

Rafa Schwinger 🇻🇦@Rafa_Schwinger

@teortaxesTex I think most models get stuck in an autoresearch local optimum if the loop is naive or there isn't a human. LLM's seem to be too sequential to intuit some sorts of common sense.

They often get stuck in the tactical and don't backtrack to the strategic.

2h631