/Tech9h ago

Researcher Voices Frustration Over AI Agent Legitimacy Checks

1533118K
Original post
Dimitris Papailiopoulos@DimitrisPapail#203inTech

A very frustrating aspect of agents is constantinly questioning if what they did was legit. This becomes MORE potent with Fable even if it is more capable. Can't know if what it does is legitimately better than what Opus would, so maybe try both and do best of 2? So annoying.

6:27 AM · Jun 10, 2026 · 4.6K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS3.4KBOOKMARKS8LIKES36RETWEETS3

Is this a legitimate out of distribution problem that the model can't do? Are its guardrails interfering? Is the model undermining me by altering promtps? WHAT IS HAPPENING?

you just can't know

A very frustrating aspect of agents is constantinly questioning if what they did was legit. This becomes MORE potent with Fable even if it is more capable. Can't know if what it does is legitimately better than what Opus would, so maybe try both and do best of 2? So annoying.

9hViews 3.4KLikes 36Bookmarks 8