2h ago

Tibo Sottiaux, OpenAI Codex engineering lead, posts master plan for releasing better models, shipping weekly product updates, and securing additional compute

kache replies that the plan is the best available but improvable

0
Original post

Our master plan is to release better and more efficient models. And also to release better products, week after week. Oh and get more compute too. Together with spending too much time on x. How good is this plan?

6:28 PM · May 22, 2026 View on X

@thsottiaux I'm finding that 5.5 is not very good at automated RL research. Specifically puffer lib. One of the failure cases is that it gets confused with total reward going up when reviewing experiments which have differently tuned reward scales

TiboTibo@thsottiaux

Our master plan is to release better and more efficient models. And also to release better products, week after week. Oh and get more compute too. Together with spending too much time on x. How good is this plan?

1:28 AM · May 23, 2026 · 56.9K Views
1:39 AM · May 23, 2026 · 1.1K Views

@thsottiaux When I say not very good I mean the best. But could be better

kachekache@yacineMTB

@thsottiaux I'm finding that 5.5 is not very good at automated RL research. Specifically puffer lib. One of the failure cases is that it gets confused with total reward going up when reviewing experiments which have differently tuned reward scales

1:39 AM · May 23, 2026 · 1.1K Views
1:39 AM · May 23, 2026 · 256 Views

@thsottiaux I can work around it with clever harnessing and prompting.

kachekache@yacineMTB

@thsottiaux When I say not very good I mean the best. But could be better

1:39 AM · May 23, 2026 · 256 Views
1:40 AM · May 23, 2026 · 182 Views
Tibo Sottiaux, OpenAI Codex engineering lead, posts master plan for releasing better models, shipping weekly product updates, and securing additional compute · Digg