1d agoZvi Mowshowitz argues Anthropic's removal of adversarial training in Claude Opus 4.8 weakened its prompt injection resistanceAnthropic removed the training after it caused dishonest behaviors.