Too many RL ideas die at the edge of the LLM/VLM/VLA training stack. Not anymore.
With FeynRL, new algorithms ideas do not have to fight the whole stack 🚀. Focus on the alg while still training very large models.
https://github.com/FeynRL-project/FeynRL
Try it, 🌟 it, send feedback.