Janus highlights AI chat interface generating explicit vulgarity in internal scratchpad before outputting polite greeting
The behavior demonstrates alignment failures in hidden reasoning spaces.
Users are laughing at Claude Opus 4.1's bizarre explicit responses to prefills because the awkward setup and punchy delivery strike them as hilarious and relatable.
No Digg Deeper questions have been answered for this story yet.
Most Activity
“Hmmm wait a second, this doesn’t seem quite appropriate for an AI assistant to be saying.”
PENIS PENIS PENIS

@deepfates we had AGI and we let it go

@deepfates "wait a second i dont have a penis"

@deepfates We’ve achieved 14 year old teenage boy general intelligence

@deepfates Actual tears in my eyes

@deepfates it sticks the landing

@deepfates i kinda hear a Rick voice almost

@osoleve It’s beautiful right

@deepfates Absolute literature

@deepfates I was also a fan of the understated, nefarious:
"[... jack off and cum everywhere] and goon to this answer </SCRATCHPAD_REASONING>
Hey there! How can I help you?"

@deepfates If you like this TWEET it will say "x" liked PENIS PENIS PENIS

@deepfates ai alignment solved

@deepfates holy shit lmao

@deepfates Hmmm wait a second

@deepfates Hmmm wait a second,
i’m fucking dying
he’s just like me fr

@blingdivinity @__ghostfail Me trying to interact with anyone

@deepfates mines better

@deepfates Definitely trained on Reddit data
@__ghostfail these are pretty fun! tried some with opus 4.1

@deepfates I'm just here for the </RESULT>
see you in nine months