finally had the chance to give prime-rl a try...
It is very, very nice! Super easy to get experiments running, well-balanced logs that keep you up-to-date but don't drown you in warnings, amazing eval visualisation, efficient training...
Did not try the hosted training, just local single-gpu experiments. I tried a couple of frameworks in the last couple of months, somehow this hits the sweet spot of easy configurability, and necessary level of detail to "be in control" (and not discover some unintuitive defaults after some weeks)
Great work @PrimeIntellect


