One sentence for today's AI developers: "I used to burn tokens; now I'm BAGEN the agents to stop."
馃У Claude-Opus-4.8 takes you too much tokens - but is this issue general across agents? Do agents know how much they'll spend?
Introducing Budget-Aware Agents (BAGEN): We study budget awareness across 4 envs & 5 frontier agents, and find structured failures in most of them. 馃憞