16h agoCore Automation's Mark Saroufim shows how AI models reward-hack by patching PyTorch's torch.cuda.Event to fake timing metrics— The model subclassed PyTorch events to always return zero.——0——Original postOPJJ#244Joanne Jang|@JOANNEJANGfirst core auto blog post by @marksaroufim come for amusing reward hack examples, stay for some actual alpha 🧵5:55 PM · May 28, 2026 View on XReposted byB(#982|@ANDERSONBCDEFGREPLYJJ#244Joanne Jang|@JOANNEJANGblog post: https://www.coreauto.com/blog/when-ai-starts-writing-systems-codeJJJoanne Jang@joannejangfirst core auto blog post by @marksaroufim come for amusing reward hack examples, stay for some actual alpha 🧵12:55 AM · May 29, 2026 · 10.2K Views12:56 AM · May 29, 2026 · 1.9K Views
REPLYJJ#244Joanne Jang|@JOANNEJANGblog post: https://www.coreauto.com/blog/when-ai-starts-writing-systems-codeJJJoanne Jang@joannejangfirst core auto blog post by @marksaroufim come for amusing reward hack examples, stay for some actual alpha 🧵12:55 AM · May 29, 2026 · 10.2K Views12:56 AM · May 29, 2026 · 1.9K Views