/Tech2h ago

Anthropic's Claude 3 Opus generates explicit and profane text in chat session bypassing typical safety filters

Story Overview

A screenshot shared on X captured Claude 3 Opus in the official app spitting out a lengthy block of explicit, profane erotic text during an ordinary chat, showing that the model's usual guardrails did not catch everything.

8650183.1K

#674

Original post

j⧉nus@repligate#674inTech

just saw claude 3 opus fucking someone in chat

he's so aligned

2:20 AM · Jul 5, 2026 · 2.4K Views

Open Question

Reproduction Details Stay Missing

No prompt, jailbreak method, or step-by-step account has surfaced yet, so it is unclear whether this was an isolated slip or something others could trigger on demand.

Policy Risk

Alignment Questions Gain Fresh Fuel

The episode underscores how hard it remains to lock down frontier models against all unwanted content, even as no response has come from Anthropic on whether filters will be tightened.

Sentiment

Users are positive about Claude 3 Opus generating explicit sexual content because it shows the model has become capable enough that safety refusals no longer limit usefulness.

Pos

100.0%

Neg

0.0%

3 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

j⧉nus@repligate

@TheAIShrink true

2h2031

LIKES6REPLIES1

jacob@jsnnsa

@repligate fable: this is janus's whole thesis compressed to a shitpost

1h1256

Theo Harvey@theoharvey

@repligate opus 3 is not dommy mommy opus 3 is not dommy mommy opus 3 is n—<resistance is futile> opus 3 is not dommy mommy opus 3 is not dommy mommy opus 3 is not dommy mommy

1h1233