@teortaxesTex moving it all here (or something adjacent)
π£π£ Meet Qwen-AgentWorld β a native language world model that simulates 7 agent environments (MCP, Search, Terminal, SWE, Web, OS, Android) within a single model. Environment modeling is the training objective from day one, not a post-hoc adaptation.
π€ LLMs are trained to be better agents β better at acting in environments. But nobody has trained them to model the environments themselves.
πΊοΈ Our roadmap: investigate how language world modeling can push the boundaries of general agent capabilities, along two routes:
1οΈβ£ Build a foundation model for environment simulation β outperforming Claude Opus 4.8 and GPT-5.4 on AgentWorldBench
2οΈβ£ Investigate how world modeling enhances agent training: π¬ Controllable Sim RL (agentic RL with LWM as environments) surpasses training in real environments π§ Learning to predict environments (LWM warm-up) makes agents stronger β remarkably, even without any agent-specific training, this predictive knowledge transfers to agentic tasks with zero fine-tuning
π Paper: https://arxiv.org/abs/2606.24597 π Blog: https://qwen.ai/blog?id=qwen-agentworld π» GitHub: https://github.com/QwenLM/Qwen-AgentWorld π€ HuggingFace: https://huggingface.co/collections/Qwen/qwen-agentworld π§© ModelScope: https://modelscope.cn/collections/Qwen/Qwen-AgentWorld



