/Tech2h ago

Tmax Releases Open RL Terminal Agent Models With Full Data And Weights

41921422331.4K

#361

Original post

Suhail@Suhail#361inTech

This is a very good entry into post training LLMs with RL. The whole recipe and data is open. Highly recommend!

Hamish Ivison @ ICML@hamishivi

Trained some terminal agents with friends!

Introducing Tmax, open RL terminal agent models. Under default settings and shorter length (65k) token budgets, tmax outperforms prior open work on terminal use. We are releasing all data+weights+rollouts publically!

2:21 PM · Jun 27, 2026 · 29.8K Views

Sentiment

Users praise Tmax's open RL Terminal Agent Models with full data and weights as a wonderful quick recipe for validating new post-training infrastructure.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS1.6KLIKES4RETWEETS1REPLIES1

Suhail@Suhail

I've been using it as a quick means of validating my new post-training infrastructure this past week. Wonderful quick recipe to make sure the pieces are working beyond a simple hello world type RL run and fixing any bottlenecks in your rollouts and such.

Suhail@Suhail

This is a very good entry into post training LLMs with RL. The whole recipe and data is open. Highly recommend!

1h1.6K40

Pluto@plut0sx

@Suhail Terminal-Bench 2.0 is one slice though, real long horizon agent work still favors closed frontier models by a wider margin.