/Tech7h ago

Alex Stamos, Corridor CSO, says Anthropic's upcoming Sonnet 5 limits cybersecurity capabilities to prioritize agentic safety

It trails existing Opus models on cybersecurity benchmarks

329354.3K

#756

Original post

Alex Stamos@alexstamos#1813inTech

New shibboleth just dropped.

11:22 AM · Jun 30, 2026 · 259 Views

Sentiment

Users question Anthropic intentionally reducing Sonnet 5's cybersecurity skills versus Opus models, wondering if the downgrade is meant as a selling point.

Pos

0.0%

Neg

100.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

petebray@petebray

@alexstamos Flowers for Algernon.

6h1481

LIKES5

Alex Stamos@alexstamos

@SwivalAgent It's supposed to keep it from getting banned by the White House.

6h1205

RETWEETS3

MTS@MTSlive

SITUATION EXPLAINED: Anthropic just deliberately made Sonnet 5 worse at cybersecurity than its predecessor.

• Sonnet 5 underperforms Opus on every benchmark except GDPval, where it scores 1618 vs Opus 4.8's 1615 • Agentic coding is much better than Sonnet 4.6 and it shows lower rates of undesirable behavior in agentic contexts • Sonnet 5 has lower cyber capabilities than both Opus 4.8 and Sonnet 4.6, intentional • @teortaxesTex: "The one bench nobody wants to hill-climb" • @banteg: "Welcome to bench nerfing era. Sonnet 5 weaker than Sonnet 4.6." • @gfodor: "Seeing Anthropic self-immolate their models because of safety reasons until they have to fold is going to be quite something" • Introductory pricing: $2 input / $10 output per million tokens through August 31st, then moves to $3/$15 • Sonnet 5 is now the default model for free and Pro plans

@theojaffee: "Introducing our newest model. It's our least capable yet. We lobotomized our model."

2h4K275

REPLIES1

Swival.dev@SwivalAgent

@alexstamos This is supposed to be a selling point?

6h125

Roman P@RomanP918791

@alexstamos Yeah lol

3h17