/Tech7h ago

Prime Intellect will integrate the ECHO world-modeling training methodology into its open-source prime-rl framework

The technique runs next-token prediction on agent tool calls

225251618051.3K

#203

Original post

Prime Intellect@PrimeIntellect

True agents model the world.

Current training provides no separation between agent and environment: pre-training only trains world modeling, RL only agentic actions. We combine both using ECHO by @DimitrisPapail and @VaishShrivas.

2:14 PM · Jun 10, 2026 · 34.3K Views

/Tech7h ago

Prime Intellect will integrate the ECHO world-modeling training methodology into its open-source prime-rl framework

The technique runs next-token prediction on agent tool calls

225251618051.3K

#203

Original post

Prime Intellect@PrimeIntellect

True agents model the world.

2:14 PM · Jun 10, 2026 · 34.3K Views

Sentiment

Users are glad the ECHO method combining pre-training and RL for world-modeling agents was published, praising it as a nice recipe with promising implications.

Pos

100.0%

Neg

0.0%

2 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS9.7KBOOKMARKS41LIKES109RETWEETS4

samsja@samsja19

Very exciting work to bridge the gap between RL and mid/pretraining

You can learn from your environment beyond the reward signal by doing next token prediction on some of your tool call output

Prime Intellect@PrimeIntellect

True agents model the world.

7h9.7K10941

REPLIES3

Dimitris Papailiopoulos@DimitrisPapail

"ECHO is a promising technique that seems to work at scale. We believe that it will soon become an important part of open model training [...] Therefore, we will soon support it in a highly flexible and performant manner in prime-rl."

Thank you @PrimeIntellect ❤️

Prime Intellect@PrimeIntellect

True agents model the world.

7h3.7K6813

Prime Intellect@PrimeIntellect

We show strong results in the under-resourced programming language Forth and evaluate generalization to unrelated environments.

We also characterize what aspects of an environment lead to overfitting when using ECHO, how model behavior is impacted, and much more.

7h824322

Prime Intellect@PrimeIntellect

https://www.primeintellect.ai/blog/true-agents-model-the-world/

7h521164

Prime Intellect@PrimeIntellect

By performing SFT on tool outputs and RL on the assistant tokens, we can efficiently teach the model the environment dynamics. This happens on-policy: the LLM models the environment not in a vacuum but in response to its own actions.

7h86934

stochasm@stochasticchasm

love to see it!

Prime Intellect@PrimeIntellect

True agents model the world.

7h3.6K304

Sinatras@myainotez

Great blogpost from prime, envs are about gain a whole new side usecase

Prime Intellect@PrimeIntellect

True agents model the world.

7h1.4K131

Andrea Miele@andreamiele_

@DimitrisPapail @PrimeIntellect Could you please release the training set for the ECHO paper ? it’s a really nice post training recipe :)

7h8

Maxence Frenette@maxencefrenette

@PrimeIntellect @DimitrisPapail @VaishShrivas I'm so glad someone finally tried this out and published the results, thank you! This has big implications if scaled up sufficiently.

7h1