23h ago

Fine-Tuning Experiments Reveal Inconsistent Backdoor Removal in Llama Models

0
Original post

can any interp folks working on understanding fine-tuning explain these results to me

6:20 PM · May 18, 2026 View on X