New Paper Uses Agent Exploitation and Patching to Fix LLM Reward Hacks · Digg