/Tech3h ago

RL Value Functions Act as Supermartingale Certificates for Stochastic Verification

15011192.9K

Original post

Very proud to have funded this work in my previous role @ARIA_research. I claimed that in environments with formal world-models, RL can be used to generate proof-carrying policies by just designing the right reward function, and this is a big theoretical and empirical validation.

11:26 AM · Jun 10, 2026 · 2.6K Views

/Tech3h ago

RL Value Functions Act as Supermartingale Certificates for Stochastic Verification

15011192.9K

#469

Original post

davidad 🎇@davidad#469inTech

11:26 AM · Jun 10, 2026 · 2.6K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS351BOOKMARKS3LIKES5

davidad 🎇@davidad

https://arxiv.org/abs/2605.31524

davidad 🎇@davidad

3h35153