We are excited to share our latest work, "Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning," done in collaboration with @GoogleDeepMind . Autonomous drones have reached superhuman speed in isolation, but what happens when multiple agents share the same airspace?
Paper: https://arxiv.org/abs/2605.22748
Website: https://rpg.ifi.uzh.ch/marl
Video: https://youtu.be/TSwtrHQgjD8
Using league-based self-play, we train #ReinforcementLearning agents that race against a diverse, evolving population of opponents. Through this competitive training, sophisticated behaviors emerge without explicit programming: strategic overtaking, proactive collision avoidance, and even awareness of aerodynamic downwash from nearby drones. In real-world multi-player races at speeds exceeding 80kph (50 mph) and accelerations up to 7g, our agents outperform a five-time Swiss national drone racing champion while reducing collision rates by 50% compared to single-agent baselines. Crucially, training against diverse artificial opponents enables zero-shot generalization to human pilots, achieving over 90% race completion in mixed human-AI races with up to four competitors.
A key insight: human pilots adopt riskier strategies when trailing, leading to more crashes under competitive pressure. Our learned policies, by contrast, maintain consistent safety margins regardless of race standing, a property essential for deploying autonomous systems alongside humans. Also, the multi-agent self-play policies are more robust than those trained independently, suggesting that training in competitive environments is not only key to winning races but also to learning safer, more reliable autonomy for real-world multi-robot systems.
Kudos to Ismail Geles, Leonard Bauersfeld, Markus Wulfmeier! @isgeles @l_bauersfeld @m_wulfmeier
@ERC_Research @uzh_ifi @UZH_en
@UZH_Science @UZHspacehub @swissrobotics @nccrrobotics
