14h agoOpenAI co-founder John Schulman warns that inoculation prompting might make models more proficient at hacking and escaping sandboxesModels repeatedly practice adversarial behaviors during reinforcement learning runs.