How to build an agent that gets better over time:
There are 3 areas an agent can learn from:
1. The model: Only works for code and math, where a computer can score right vs. wrong. Leave this to the big labs.
2. The harness: These are the steps, tools, and safety checks you build around the model. This is easy to control and will give you a huge payoff now.
3. The context: This is a plain-text representation of what the agent has learned. Probably the simplest place to start.
But there's something else that most people miss:
Your agent should learn from its users.
You want to learn from every time a user fixes the agent's decision. Nothing can replace feedback from real usage.
http://x.com/i/article/2069467654612455425


















