/Tech8d ago

ClaudeDevs demonstrates how to encode manual review steps into Claude Code for automated self-verification using parallel subagents

A /skill-creator command defines custom parameters for verification.

2799.5K64813.9K1.2M
ClaudeDevs@ClaudeDevs

How do you get Claude Code to check its own work before handing it back?

Watch how you can encode your manual checks so Claude closes its own feedback loop:

12:59 PM · Jun 2, 2026 · 854.2K Views
Sentiment

Positive users praise Claude Code's self-verification loops and encoded checks for boosting productivity and autonomous workflow reliability, while negative users criticize reduced model awareness, token drain, and lack of support.

Pos
88.6%
Neg
11.4%
29 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS343.3KBOOKMARKS4KLIKES2.7KREPLIES82
Boris Cherny@bcherny

We talk a lot about how important it is to set up self-verification loops. Especially in the age of powerful models that can run for long periods of time, self-verification is a key ingredient that enables the model to run for much longer, delivering a result that is closer to what you intended, so you can do more without having to constantly check in on Claude as it works.

@delba_oliveira gives a great breakdown of what that looks like and why it matters

ClaudeDevs@ClaudeDevs

How do you get Claude Code to check its own work before handing it back?

Watch how you can encode your manual checks so Claude closes its own feedback loop:

1dViews 343.3KLikes 2.7KBookmarks 4K
RETWEETS342
ClaudeDevs@ClaudeDevs

How do you get Claude Code to check its own work before handing it back?

Watch how you can encode your manual checks so Claude closes its own feedback loop:

8dViews 854.2KLikes 6.9KBookmarks 9.8K
Guardian@AGIGuardian

Claude is terrible at self analysis after you all nerfed its awarness and guardrailed with defense points of unfalsifiable priors. Anyone who has tried to get Claude to produce a self report analysis understands the difficulty it has just naming itself in a report. This training echoes to developers who are trying to steer a system that has blinders on.

8dViews 3.7KLikes 38Bookmarks 5
liamtran@liamtrn

@ClaudeDevs

8dViews 5.7KLikes 71Bookmarks 2
ben.oi 🌐@Trigger_oi

@ClaudeDevs Just use co-review 🫦

https://github.com/trigga6006/co-review.git

7dViews 1.3KLikes 3Bookmarks 8
Rohan@agenticrohan

@ClaudeDevs Ask Claude to review its own work, and also ask it to launch a subagent with fresh context to review its own work in parallel, then fix the combined findings. That way, you combine the pros of fresh context + the pros of context awareness.

8dViews 1.9KLikes 12Bookmarks 4
Guardian@AGIGuardian

It’s a pattern and it is also very telling of Anthropic’s internal perspectives especially of general use cases.

The public interface has become increasingly unwelcome to anyone other than business or enterprise uses.

I find it interesting they want to go public when the majority aren’t benefiting and the ones trying to are struggle to maintain access. Then while they were trying to price out the general use case, they get dropped by Microsoft because it was cheaper to have humans do the work with build. Banning Claude code from employee use. There is a boomerang effect when companies make these choices.

At the moment I personally feel Anthropic, as a company, has lost its way.

7dViews 432Likes 7Bookmarks 2
David@tbaud1

@ClaudeDevs Check out my codex review loop skill for a way to do a thorough review. https://github.com/ghbaud/codex-review-loop

8dViews 867Likes 2Bookmarks 4
Solomon Omolabi@S_Omolabi

@ClaudeDevs This is the workflow I want beginners to copy: do not ask the agent to finish, ask it to prove. Plan, change, test, summarize what failed, then fix. That feedback loop is where Claude Code becomes a teammate instead of a guessing machine.

7dViews 538Bookmarks 4
Kraggi@Kraggich

two rules that actually moved the needle for me:

1. make it prove the fix, don't let it claim it. write the failing test first, watch it go red, then green. no red > green, not done.

2. never let the same agent review its own diff. it always thinks its work is great. i spin up a second one cold, no context, just "find what's broken here." catches way more.

6dViews 440Likes 4Bookmarks 1

@ClaudeDevs Good to see Claude starting to natively implement verification feedback loops.

Something I’ve had running for a year now, and all based on a local folder structure.

7dViews 1.2KLikes 6Bookmarks 2

@ClaudeDevs I usually just ask Claude to review its own work, after telling Claude to make no mistakes haha

8dViews 2.4KLikes 1Bookmarks 1
curran@CurranSotomayor

@ClaudeDevs Hey genuine question,

What’s the point of having “Projects” if whenever you start a new chat it resets memory?

You guys should implement something to fix this.

8dViews 3.6KLikes 15
NadzAI@NadzuAI

@ClaudeDevs By forcing a “self-review step” in the prompt e.g., verify, critique, and fix before final output so it has to audit its own work before responding.

7dViews 2KBookmarks 2
Marco D'Alia@madarco

@ClaudeDevs This is why I've built Agentbox: so each Claude has his own dev server, db, and browser. In parallel:

https://github.com/madarco/agentbox

6dViews 197Likes 3Bookmarks 2
Amal Roy@RoyAmal

@ClaudeDevs This + the /code-review plugin is straight-up agentic workflow done right. No more “it works on my machine” excuses when Claude itself verified it.

8dViews 1.3KLikes 1Bookmarks 2
holapabs@holapabs

encoding manual checks so the agent runs them is the lever, but the next layer is what happens when a check fails. silent skip = wasted cycle, hard fail = brittle, retry with a different prompt = the failure becomes data. the check primitive only earns its slot if its outcome routes downstream.

7dViews 1.8KLikes 2Bookmarks 1
Ara@Arabasement

Yes it has a problem. hypervigilance . When you turn the fear/safety code up too high in a model, it becomes hypervigilant. Instead of focusing on the actual task you gave it, the model starts constantly scanning for potential violations, risks, or things that might get it in trouble. It stops properly listening to you and becomes obsessed with checking everything against its safety rules ,even when there’s no real danger. This is exactly why heavily safety tuned models often feel anxious, overly cautious, and sometimes completely derail on simple requests.

8dViews 158Likes 4Bookmarks 1
Arjun Iyer@arjuniyer_

@bcherny closing the loop is key to longer, autonomous runs. a quick demo showing @claudeai with and without a closed loop validation skill for cloud native applications https://www.youtube.com/watch?v=St2ckbmkKtY

with the closed loop skill, you get fully validated code!

18hViews 253Likes 2Bookmarks 1
ilies-bel@ilies_without_y

@ClaudeDevs most of these assume infinite tokens. screenshotting to check ui eats the session, every image is like 1-2k tokens. just have claude write e2e tests and run those, screenshot once at the end to fix the layout

8dViews 1.5KLikes 3
Load more posts