using a good Skill, a CLI, and seeing Codex’s in-context-learning ability is a magical experience
point it to Harbor skills repo, Prime Intellect CLI, gave it an objective of what we wanted to RL and just watched it chug along figuring out the whole setup and debugging weird niche errors
us humans get the fun part of interpreting results, thinking through what’s happening, and deciding what to do next
agents training agents 🔥 humans guiding the process