2h ago

Michelle Chen, Senior Product Manager for AI at Cloudflare, and research engineer Will Brown trained open models with reinforcement learning to replicate OpenAI’s goblin problem using Prime Intellect infrastructure

An interactive demo shows the RL training steps and outputs.

1
Original post

reverse engineering openai’s goblin problem: we took open models and trained them with RL to talk about goblins an experiment by @willccbb and me, trained on @PrimeIntellect. here's an interactive blog of how RL works and how we achieved goblin mode https://goblins.mchen.workers.dev

5:23 PM · May 20, 2026 View on X

had soooo much fun going goblin mode with @michellechen for these models

say hello to goblintron :)

michellemichelle@michellechen

reverse engineering openai’s goblin problem: we took open models and trained them with RL to talk about goblins an experiment by @willccbb and me, trained on @PrimeIntellect. here's an interactive blog of how RL works and how we achieved goblin mode https://goblins.mchen.workers.dev

12:23 AM · May 21, 2026 · 11.8K Views
12:27 AM · May 21, 2026 · 3.1K Views
michellemichelle@michellechen

reverse engineering openai’s goblin problem: we took open models and trained them with RL to talk about goblins an experiment by @willccbb and me, trained on @PrimeIntellect. here's an interactive blog of how RL works and how we achieved goblin mode https://goblins.mchen.workers.dev

12:23 AM · May 21, 2026 · 11.8K Views
12:51 AM · May 21, 2026 · 3.8K Views

AGI achieved

michellemichelle@michellechen

reverse engineering openai’s goblin problem: we took open models and trained them with RL to talk about goblins an experiment by @willccbb and me, trained on @PrimeIntellect. here's an interactive blog of how RL works and how we achieved goblin mode https://goblins.mchen.workers.dev

12:23 AM · May 21, 2026 · 11.8K Views
1:00 AM · May 21, 2026 · 654 Views