Anthropic Diagrams Show How Jailbreaks Bypass Safety Classifiers · Digg