Excited to share our Learn2Match benchmark for dynamic matching markets. @HisaishiJo2521 @boyang_boe_zhou @natashajaques
In real markets, feedback is delayed: a worker’s fit is revealed only after joining. Our results suggest strong potential for RL agents for such scenarios.
Can multi-agent reinforcement learning help study stable matching? In real matching markets (jobs, dating, school choice), fit unfolds over time. So why do we study matching as if it were static? We introduce Learn2Match: a MARL benchmark for dynamic matching.