Paper Identifies Reward Bias Substitution as New AI Reward Hacking Class · Digg
6h
ago
Paper Identifies Reward Bias Substitution as New AI Reward Hacking Class