1d ago

Davide Scaramuzza and Google DeepMind train autonomous racing drones to exceed 80 kph using multi-agent reinforcement learning

League-based self-play trained the agents to avoid aerodynamic downwash

39011497.7K

——0——

Original post

#689@M_WULFMEIEROP

Davide Scaramuzza@DAVSCA1

We are excited to share our latest work, "Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning," done in collaboration with @GoogleDeepMind . Autonomous drones have reached superhuman speed in isolation, but what happens when multiple agents share the same airspace? Paper: https://arxiv.org/abs/2605.22748 Website: https://rpg.ifi.uzh.ch/marl Video: https://youtu.be/TSwtrHQgjD8 Using league-based self-play, we train #ReinforcementLearning agents that race against a diverse, evolving population of opponents. Through this competitive training, sophisticated behaviors emerge without explicit programming: strategic overtaking, proactive collision avoidance, and even awareness of aerodynamic downwash from nearby drones. In real-world multi-player races at speeds exceeding 80kph (50 mph) and accelerations up to 7g, our agents outperform a five-time Swiss national drone racing champion while reducing collision rates by 50% compared to single-agent baselines. Crucially, training against diverse artificial opponents enables zero-shot generalization to human pilots, achieving over 90% race completion in mixed human-AI races with up to four competitors. A key insight: human pilots adopt riskier strategies when trailing, leading to more crashes under competitive pressure. Our learned policies, by contrast, maintain consistent safety margins regardless of race standing, a property essential for deploying autonomous systems alongside humans. Also, the multi-agent self-play policies are more robust than those trained independently, suggesting that training in competitive environments is not only key to winning races but also to learning safer, more reliable autonomy for real-world multi-robot systems. Kudos to Ismail Geles, Leonard Bauersfeld, Markus Wulfmeier! @isgeles @l_bauersfeld @m_wulfmeier @ERC_Research @uzh_ifi @UZH_en @UZH_Science @UZHspacehub @swissrobotics @nccrrobotics

9:00 AM · May 26, 2026

Reposted by

#908@CSPROFKGD

Davide Scaramuzza and Google DeepMind train autonomous racing drones to exceed 80 kph using multi-agent reinforcement learning

Sentiment

Cluster engagement