/Tech7h ago

Prime Intellect will integrate the ECHO world-modeling training methodology into its open-source prime-rl framework

The technique runs next-token prediction on agent tool calls

225251618051.3K
Original post
Prime Intellect@PrimeIntellect

True agents model the world.

Current training provides no separation between agent and environment: pre-training only trains world modeling, RL only agentic actions. We combine both using ECHO by @DimitrisPapail and @VaishShrivas.

2:14 PM · Jun 10, 2026 · 34.3K Views
Sentiment

Users are glad the ECHO method combining pre-training and RL for world-modeling agents was published, praising it as a nice recipe with promising implications.

Pos
100.0%
Neg
0.0%
2 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS9.7KBOOKMARKS41LIKES109RETWEETS4
samsja@samsja19

Very exciting work to bridge the gap between RL and mid/pretraining

You can learn from your environment beyond the reward signal by doing next token prediction on some of your tool call output

Prime Intellect@PrimeIntellect

True agents model the world.

Current training provides no separation between agent and environment: pre-training only trains world modeling, RL only agentic actions. We combine both using ECHO by @DimitrisPapail and @VaishShrivas.

7hViews 9.7KLikes 109Bookmarks 41
REPLIES3

"ECHO is a promising technique that seems to work at scale. We believe that it will soon become an important part of open model training [...] Therefore, we will soon support it in a highly flexible and performant manner in prime-rl."

Thank you @PrimeIntellect ❤️

Prime Intellect@PrimeIntellect

True agents model the world.

Current training provides no separation between agent and environment: pre-training only trains world modeling, RL only agentic actions. We combine both using ECHO by @DimitrisPapail and @VaishShrivas.

7hViews 3.7KLikes 68Bookmarks 13
Prime Intellect@PrimeIntellect

We show strong results in the under-resourced programming language Forth and evaluate generalization to unrelated environments.

We also characterize what aspects of an environment lead to overfitting when using ECHO, how model behavior is impacted, and much more.

7hViews 824Likes 32Bookmarks 2
Prime Intellect@PrimeIntellect

Read more:

https://www.primeintellect.ai/blog/true-agents-model-the-world/

7hViews 521Likes 16Bookmarks 4
Prime Intellect@PrimeIntellect

By performing SFT on tool outputs and RL on the assistant tokens, we can efficiently teach the model the environment dynamics. This happens on-policy: the LLM models the environment not in a vacuum but in response to its own actions.

7hViews 869Likes 34
stochasm@stochasticchasm

love to see it!

Prime Intellect@PrimeIntellect

True agents model the world.

Current training provides no separation between agent and environment: pre-training only trains world modeling, RL only agentic actions. We combine both using ECHO by @DimitrisPapail and @VaishShrivas.

7hViews 3.6KLikes 30Bookmarks 4
Sinatras@myainotez

Great blogpost from prime, envs are about gain a whole new side usecase

Prime Intellect@PrimeIntellect

True agents model the world.

Current training provides no separation between agent and environment: pre-training only trains world modeling, RL only agentic actions. We combine both using ECHO by @DimitrisPapail and @VaishShrivas.

7hViews 1.4KLikes 13Bookmarks 1
Andrea Miele@andreamiele_

@DimitrisPapail @PrimeIntellect Could you please release the training set for the ECHO paper ? it’s a really nice post training recipe :)

7hViews 8
Maxence Frenette@maxencefrenette

@PrimeIntellect @DimitrisPapail @VaishShrivas I'm so glad someone finally tried this out and published the results, thank you! This has big implications if scaled up sufficiently.

7hViews 1