20h ago

AI leaker Jimmy Apples and other creators claim Anthropic and OpenAI are deliberately sandbagging their models

The restrictions are allegedly meant to prioritize safety and alignment.

โ€”โ€”0โ€”โ€”
Original post

Getting the feeling Anthropic is sandbagging with their models, something Iโ€™d be worried about if I was openai and ant gets better alignment and safety down. In the meantime, looking forward to the inevitable jailbreak of their mythos class models

2:36 PM ยท May 28, 2026 View on X

wdym "getting the feeling" ?

we KNOW they are sandbagging

Jimmy Apples ๐ŸŽ/accJimmy Apples ๐ŸŽ/acc@apples_jimmy

Getting the feeling Anthropic is sandbagging with their models, something Iโ€™d be worried about if I was openai and ant gets better alignment and safety down. In the meantime, looking forward to the inevitable jailbreak of their mythos class models

9:36 PM ยท May 28, 2026 ยท 34.6K Views
2:01 AM ยท May 29, 2026 ยท 15.6K Views

@apples_jimmy kinda feels like both of them are

Jimmy Apples ๐ŸŽ/accJimmy Apples ๐ŸŽ/acc@apples_jimmy

Getting the feeling Anthropic is sandbagging with their models, something Iโ€™d be worried about if I was openai and ant gets better alignment and safety down. In the meantime, looking forward to the inevitable jailbreak of their mythos class models

9:36 PM ยท May 28, 2026 ยท 34.6K Views
10:40 PM ยท May 28, 2026 ยท 3.6K Views