/Tech23h ago

Build Self-Learning AI Agents With Model, Harness, Context And User Feedback

341.4K2461.9K120.2K

#1889

Original post

Santiago@svpino#1889inTech

How to build an agent that gets better over time:

There are 3 areas an agent can learn from:

1. The model: Only works for code and math, where a computer can score right vs. wrong. Leave this to the big labs.

2. The harness: These are the steps, tools, and safety checks you build around the model. This is easy to control and will give you a huge payoff now.

3. The context: This is a plain-text representation of what the agent has learned. Probably the simplest place to start.

But there's something else that most people miss:

Your agent should learn from its users.

You want to learn from every time a user fixes the agent's decision. Nothing can replace feedback from real usage.

Atai Barkai@ataiiam

http://x.com/i/article/2069467654612455425

11:20 AM · Jun 25, 2026 · 120K Views

Sentiment

Positive users praise self-learning AI agents using user feedback loops and harness refinements as an underrated moat and major advance, while negative users note costs from safety layer false positives.

Pos

90.9%

Neg

9.1%

11 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Related links

2069467654612455425

X.COMVia

Posts from X

Most Activity

Ali Sherief@Zenul_Abidin

@svpino The only way an agent can take feedback from it's users is by using said feedback to train a new version of the model, OR train some sort of bootstrap model who's only job is to take the input prompt and refine it with the feedback it was trained on.

22h389

BOOKMARKS1

安叫兽|Bird🕊️ 🔶 BNB@ajs6888

@svpino 第三点被时间线吃掉了吗

19h1611

LIKES1

Nick Venturi@nickventuri

@svpino most people just prompt and pray

15h1381

RETWEETS147

Santiago@svpino

How to build an agent that gets better over time:

There are 3 areas an agent can learn from:

1. The model: Only works for code and math, where a computer can score right vs. wrong. Leave this to the big labs.

2. The harness: These are the steps, tools, and safety checks you build around the model. This is easy to control and will give you a huge payoff now.

3. The context: This is a plain-text representation of what the agent has learned. Probably the simplest place to start.

But there's something else that most people miss:

Your agent should learn from its users.

You want to learn from every time a user fixes the agent's decision. Nothing can replace feedback from real usage.

Atai Barkai@ataiiam

http://x.com/i/article/2069467654612455425

23h120K1.4K1.9K

REPLIES1

Yann Ribemont@YannRibemont

@svpino @threadreaderapp unroll

12h25

Farhad Nawab@FarhadNawab

@svpino the learn from user corrections point is the one most teams never get to because they're still arguing about which model to use.

17h1371

Samuel Mwania@mwaaniasamuel

The point that nobody watches the user might be the most underrated idea in agent design right now. Everyone obsesses over the model and harness, but the moment a user corrects the agent is the richest signal you'll ever get, and most systems just throw it away. This reframed how I think about my own builds. Thank you for laying it out so clearly.

23h122

JY@JacksonYou668

@svpino cool

15h109

Dipanshu Kushwaha@Dipanshu_AI

@svpino Sounds like a solid roadmap! Learning from various areas is key to growth. Excited to see where this journey leads!

15h102

Carl Xien@CarlXien

@svpino training with real human interactions and decisions Nice

13h75

Rimsha Bhardwaj@heyrimsha

@svpino Balancing model development with practical user interactions truly makes a difference.

13h71

Michał Piszczek@cdiamond

@svpino the model layer is the one everyone fixates on and can't move. harness and context are the only levers you own in prod

22h62

Syntax Bloom@SyntaxBloomX

@svpino the user-feedback loop is the closest thing to an ltv moat in the agent space. every correction is a labeled data point that your competitor's agent doesn't have

19h61

Adel Bucetta@adelbucetta

@svpino the honest answer is that most people get tripped up in the harness, thinking the model is where the real learning happens, but that's just the easy part. the hard part is figuring out how to connect the model's output to real-world actions without breaking everything.

20h35

John Cramer@CramerCronicles

@svpino Appreciate it

12h25

Blum@Blum_OG

@svpino developing agents that improve over time is probably one of the biggest jumps in AI this year

12h25

Thread Reader App@threadreaderapp

@YannRibemont @svpino @YannRibemont Hello, the unroll you asked for: https://threadreaderapp.com/thread/2070210421995569537.html Have a good day. 🤖

12h19

Uncle J@UncleJAI

@svpino The harness is where most teams can actually improve the agent.

The model is mostly someone else's roadmap.

But the checks, tools, task boundaries, feedback capture, and recovery path are yours. That's where compounding starts.

18h19

Dmitrii Malakhov@malakhovdm

@svpino The harness learning loop is clean in theory until the safety layer's false positives cost more than any model improvement.

15h10

AI Mastery Guide@aiseomastery

@svpino Learning from every time a user corrects it is such an obvious signal but barely anyone builds for it.

12h6