/Tech10h ago

Ross Taylor, General Reasoning co-founder, outlines long-horizon RL tradeoffs and infrastructure challenges expected by 2026

The talk details upcoming shifts in actor-critic algorithms.

0120119

#225

Original post

Ross Taylor@rosstaylor90#842inTech

Speaking at @aiDotEngineer today about long-horizon RL and the different tradeoffs involved. 2:25pm.

Also here in SF for a week if anyone wants to catchup 👋

Ross Taylor@rosstaylor90

Excited to be speaking at @aiDotEngineer World’s Fair alongside partner in crime @ChengxiTaylor.

It seems we have a lot to talk about:

🤼‍♂️ Actor-critic is hot again? How are current RL algorithms changing with long-horizon rollouts. 🌍 What does a good long-horizon environment look like? How do they differ from terminal based tasks? 🤖 What does good RL infra look like in 2026? Why is long horizon breaking things?

As a bonus, I’ll give a run down of our own RL journey starting from Galactica (2021/2022), early SoTA reasoning efforts at Meta (2023), and more.

Super fun, high signal-to-noise guaranteed. Come join us!

6:51 AM · Jun 30, 2026 · 119 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS1.4KBOOKMARKS2LIKES5

Ross Taylor@rosstaylor90

Room 2016 for those attending @aiDotEngineer 2:25pm.

Will also cover Galactica, early Llama reasoning efforts and more - think this is the first time I’ve ever covered this in a public talk 👀.

@swyx

3h1.4K52

RETWEETS2

Ross Taylor@rosstaylor90

Speaking at @aiDotEngineer today about long-horizon RL and the different tradeoffs involved. 2:25pm.

Also here in SF for a week if anyone wants to catchup 👋

Ross Taylor@rosstaylor90

Excited to be speaking at @aiDotEngineer World’s Fair alongside partner in crime @ChengxiTaylor.

It seems we have a lot to talk about:

As a bonus, I’ll give a run down of our own RL journey starting from Galactica (2021/2022), early SoTA reasoning efforts at Meta (2023), and more.

Super fun, high signal-to-noise guaranteed. Come join us!

10h11910