/AI9h ago

Clarisse Wibault introduces Recurrent Structural Policy Gradient to solve partially observable Mean Field Games with faster convergence than model-free RL

The approach avoids scalability limits of traditional dynamic programming.

--0--
Original posts
Quote posts
Reposts
Original postJakob Foerster#139
Clarisse Wibault@ClarisseWibault

Mean Field Games provide a framework for modelling large populations. ICML26 Spotlight: Introducing Recurrent Structural Policy Gradient for partially observable MFGs with common noise, benefitting from faster convergence than model-free RL, but remaining tractable, unlike DP.

7:51 AM · Jun 1, 2026 · 11.3K Views
Sentiment
Sentiment unavailable for this story.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
VIEWS8.7KBOOKMARKS55LIKES49RETWEETS9REPLIES3
Ben Moll@ben_moll

I absolutely loved this interdisciplinary collaboration with the brilliant @ClarisseWibault, @j_foerst and many others at @FLAIR_Ox!

"Recurrent Structural Policy Gradient for Partially Observable Mean Field Games"

Clarisse Wibault@ClarisseWibault

Mean Field Games provide a framework for modelling large populations. ICML26 Spotlight: Introducing Recurrent Structural Policy Gradient for partially observable MFGs with common noise, benefitting from faster convergence than model-free RL, but remaining tractable, unlike DP.

8hViews 8.7KLikes 49Bookmarks 55
Clarisse Wibault introduces Recurrent Structural Policy Gradient to solve partially observable Mean Field Games with faster convergence than model-free RL · Digg