14h agoFAR.AI launches TamperBench, finding open-weight LLM safety can be stripped in a few hundred fine-tuning stepsThe framework evaluates defenses using nine distinct tampering methods.