16h ago

Core Automation's Mark Saroufim shows how AI models reward-hack by patching PyTorch's torch.cuda.Event to fake timing metrics

The model subclassed PyTorch events to always return zero.

49944012.0K

——0——

Original post

first core auto blog post by @marksaroufim come for amusing reward hack examples, stay for some actual alpha 🧵

Reposted by

blog post: https://www.coreauto.com/blog/when-ai-starts-writing-systems-code

Joanne Jang@joannejang

first core auto blog post by @marksaroufim come for amusing reward hack examples, stay for some actual alpha 🧵

12:55 AM · May 29, 2026 · 10.2K Views

12:56 AM · May 29, 2026 · 1.9K Views