Harbor is really great. I like the design and it's well polished for doing evals. It would be great to use the same rollout utility for everything (RL / eval / new tasks definitions).
Rollouts for eval, rollouts for RL, rollouts for GEPA, rollouts for prod, rollouts for trajectory analysis, rollouts for SFT data gen, rollouts rollouts rollouts


