/Tech2d ago

Stanford's Rylan Schaeffer highlights research showing AI agents perform worse over time when self-reflecting and updating their own prompts

Academic Jason Lee suggested using basic prompts to continue instead

--0--

#111

Original post

Rylan Schaeffer@RylanSchaeffer#535inTech

@jasondeanlee There's a nice paper I might be able to dig up about how asking agents to reflect on trajectories and update with learnings yields *worse* performance over time

Just because models can update their own prompts doesn't mean they're good at doing so

Jason Lee@jasondeanlee

I am convinced I am still useful. I spent about 10 days trying to write skill files and harness in codex with gpt 5.5 pro to replace myself : query gpt 5. 5 pro, sanity check, and ask it to rewrite or clarify or suggest vague new strategies. The results are not very good. I am still better at prompting than the harness

5:11 PM · Jun 14, 2026 · 913 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS703BOOKMARKS1LIKES1

Jason Lee@jasondeanlee

@RylanSchaeffer Interesting. I think I can delete my harness and just have it prompt with 'continue'

Rylan Schaeffer@RylanSchaeffer

@jasondeanlee There's a nice paper I might be able to dig up about how asking agents to reflect on trajectories and update with learnings yields *worse* performance over time

Just because models can update their own prompts doesn't mean they're good at doing so

2d70311