Prime Intellect introduces General-Agent, a fully synthetic reinforcement learning environment that generates self-evolving tool-use tasks with 4,504 examples across 1,040 domains and 8,159 tools
Tasks advance automatically via synthesizer, solver, and gate components.
Automating RL environments is the next step toward automating everything else.
Introducing general-agent by @mikasenghaas > open agentic environments with 1000s of tools are scarce, so we're building one that builds itself > A synthesizer evolves tasks across difficulty tiers, empirically gated by a solver. Hard tiers seed the next wave, hillclimbing toward frontier-level difficulty. > 4,504 tasks / 1,040 domains / 8,159 unique tools
The next step toward automating AI is automating RL environments Introducing General-Agent: A fully synthetic environment whose task corpus self-evolves and grows harder over time 4,504 tool-use tasks · 1,040 domains · 8,159 unique tools
@mikasenghaas https://www.primeintellect.ai/blog/general-agent
Automating RL environments is the next step toward automating everything else. Introducing general-agent by @mikasenghaas > open agentic environments with 1000s of tools are scarce, so we're building one that builds itself > A synthesizer evolves tasks across difficulty tiers, empirically gated by a solver. Hard tiers seed the next wave, hillclimbing toward frontier-level difficulty. > 4,504 tasks / 1,040 domains / 8,159 unique tools
making the tech that closed labs have open auf giving it to everyone, one release after another :)
The next step toward automating AI is automating RL environments Introducing General-Agent: A fully synthetic environment whose task corpus self-evolves and grows harder over time 4,504 tool-use tasks · 1,040 domains · 8,159 unique tools
making the tech that closed labs have open and giving it to everyone, one release after another :)
The next step toward automating AI is automating RL environments Introducing General-Agent: A fully synthetic environment whose task corpus self-evolves and grows harder over time 4,504 tool-use tasks · 1,040 domains · 8,159 unique tools
go from idea, to environment, to running 1000s of parallel, multi-hour agent episodes within a day.
open superintelligence stack
highly recommend checking out @mikasenghaas full blog post on the general agent environment release with all the details and experiments:
go from idea, to environment, to running 1000s of parallel, multi-hour agent episodes within a day. open superintelligence stack