// AutoMem //
I quite like this idea of metamemory.
(bookmark it)
This new research from Stanford treats agent's memory management as a trainable skill instead of a fixed module.
The model decides what to encode, when to retrieve, and how to organize its own notes, with file-system operations promoted to first-class actions right alongside task actions.
AutoMem automates this on two loops. A strong LLM reviews full trajectories and rewrites the memory structure (prompts, schemas, action vocabulary). Then the agent's own good memory decisions across episodes become training signal to sharpen its proficiency.
Optimizing memory alone, without touching task-action behavior, lifts the base agent 2x to 4x on Crafter, MiniHack, and NetHack. That is enough to make a 32B open model competitive with Claude Opus 4.5 and Gemini 3.1 Pro Thinking. For long-horizon agents, memory is a high-leverage objective you can train for on its own.
Paper: https://arxiv.org/abs/2607.01224
Learn to build effective AI agents in our academy: https://academy.dair.ai/



