Researcher Prefers Discrete Time and Convex Optimization Over Continuous Stochastic Processes

REPLY

this is what motivated me to start thinking about measure transport as a discrete-time sequential decision-making problem: take an input sample X_0 and perform a sequence of transformations to it to obtain X_H 5/

Gergely Neu@neu_rips

i admire people that can deal with that sort of math, but unfortunately i'm not one of them... i'm a computer scientist and i can only deal with discrete time! also, i prefer convex analysis & optimization as the basis of my algorithms 4/

10:40 AM · May 22, 2026 · 394 Views

10:40 AM · May 22, 2026 · 379 Views

REPLY

#602Gergely Neu@NEU_RIPS

this looks like an RL problem, except that we want to impose the hard constraint that the final state should match the the desired target distribution. this makes our setting a "constrained markov decision process", which can be really hard to solve in general 6/

Gergely Neu@neu_rips

this is what motivated me to start thinking about measure transport as a discrete-time sequential decision-making problem: take an input sample X_0 and perform a sequence of transformations to it to obtain X_H 5/

10:40 AM · May 22, 2026 · 379 Views

10:40 AM · May 22, 2026 · 366 Views

REPLY

#602Gergely Neu@NEU_RIPS

we use this insight to develop a new family of generative models that generate samples by performing gradient descent (in sample space!) on a sequence of learned value functions 8/

Gergely Neu@neu_rips

luckily, MDP / optimal control theory still helps us to figure out the structure of the solution. turns out, the optimal policy comes in the form of a "value-driven transport" policy that is fully characterized by the analogue of "optimal value functions" from MDP theory 7/

10:40 AM · May 22, 2026 · 374 Views

10:40 AM · May 22, 2026 · 706 Views

REPLY

#602Gergely Neu@NEU_RIPS

luckily, MDP / optimal control theory still helps us to figure out the structure of the solution. turns out, the optimal policy comes in the form of a "value-driven transport" policy that is fully characterized by the analogue of "optimal value functions" from MDP theory 7/

Gergely Neu@neu_rips

this looks like an RL problem, except that we want to impose the hard constraint that the final state should match the the desired target distribution. this makes our setting a "constrained markov decision process", which can be really hard to solve in general 6/

10:40 AM · May 22, 2026 · 366 Views

10:40 AM · May 22, 2026 · 374 Views

REPLY

#602Gergely Neu@NEU_RIPS

how do we compute good value functions then? for this, we take inspiration (once again) from classic results in optimal control that characterize the optimal solution in terms of a linear program (LP) in the space of state distributions generated by the optimal policy 9/

Gergely Neu@neu_rips

we use this insight to develop a new family of generative models that generate samples by performing gradient descent (in sample space!) on a sequence of learned value functions 8/

10:40 AM · May 22, 2026 · 706 Views

10:40 AM · May 22, 2026 · 365 Views

REPLY

#602Gergely Neu@NEU_RIPS

after some gymnastics, one can show that the optimal value function is exactly the set of optimal dual variables for this LP --- so they can be computed by unconstrained stochastic optimization of the dual function! 10/

Gergely Neu@neu_rips

how do we compute good value functions then? for this, we take inspiration (once again) from classic results in optimal control that characterize the optimal solution in terms of a linear program (LP) in the space of state distributions generated by the optimal policy 9/

10:40 AM · May 22, 2026 · 365 Views

10:40 AM · May 22, 2026 · 319 Views

REPLY

#602Gergely Neu@NEU_RIPS

and indeed stochastic gradients can be computed easily by exploiting the structure of the LP and the associated lagrangian 11/

Gergely Neu@neu_rips

after some gymnastics, one can show that the optimal value function is exactly the set of optimal dual variables for this LP --- so they can be computed by unconstrained stochastic optimization of the dual function! 10/

10:40 AM · May 22, 2026 · 319 Views

10:40 AM · May 22, 2026 · 322 Views

REPLY

#602Gergely Neu@NEU_RIPS

putting things together, we obtain a conceptually straightforward* primal-dual training method for computing value functions (* conceptually straightforward \neq simple. there are many fine details, but the method is not very hard to implement at the end) 12/

Gergely Neu@neu_rips

and indeed stochastic gradients can be computed easily by exploiting the structure of the LP and the associated lagrangian 11/

10:40 AM · May 22, 2026 · 322 Views

10:40 AM · May 22, 2026 · 303 Views

REPLY

#602Gergely Neu@NEU_RIPS

we coded it up and managed to make it work without any nasty tricks! on small-scale examples, it performs competitively with state-of-the-art methods like conditional flow matching, diffusion schrodinger bridge matching, etc. 13/

Gergely Neu@neu_rips

putting things together, we obtain a conceptually straightforward* primal-dual training method for computing value functions (* conceptually straightforward \neq simple. there are many fine details, but the method is not very hard to implement at the end) 12/

10:40 AM · May 22, 2026 · 303 Views

10:40 AM · May 22, 2026 · 324 Views

REPLY

#602Gergely Neu@NEU_RIPS

we also have some promising results on image data, although admittedly only on simpler data sets. (the method also supports fancy extensions such as few-step generation, image-to-image translation, classifier-free guidance, as shown in these images) 14/

Gergely Neu@neu_rips

we coded it up and managed to make it work without any nasty tricks! on small-scale examples, it performs competitively with state-of-the-art methods like conditional flow matching, diffusion schrodinger bridge matching, etc. 13/

10:40 AM · May 22, 2026 · 324 Views

10:40 AM · May 22, 2026 · 278 Views

REPLY

#602Gergely Neu@NEU_RIPS

for larger data sets, we still need some more tuning effort :D here's an exclusive sneak peak at some samples of the horrifying images we managed to generate. obviously not there yet, but it's a good indication that the method might work well at larger scales 15/

Gergely Neu@neu_rips

we also have some promising results on image data, although admittedly only on simpler data sets. (the method also supports fancy extensions such as few-step generation, image-to-image translation, classifier-free guidance, as shown in these images) 14/

10:40 AM · May 22, 2026 · 278 Views

10:40 AM · May 22, 2026 · 271 Views

REPLY

#602Gergely Neu@NEU_RIPS

what i particularly liked about the solution is that it does not require a direct parametrization of the control drift, but works directly with the value-driven structure of the optimal policy. this might have some serious practical advantages 16/

Gergely Neu@neu_rips

for larger data sets, we still need some more tuning effort :D here's an exclusive sneak peak at some samples of the horrifying images we managed to generate. obviously not there yet, but it's a good indication that the method might work well at larger scales 15/

10:40 AM · May 22, 2026 · 271 Views

10:40 AM · May 22, 2026 · 280 Views

REPLY

#602Gergely Neu@NEU_RIPS

i could go on all day, as i've been quite obsessed with these ideas over the last couple of months. none of us has worked on anything like this before, but we enjoyed the learning process greatly, and we hope that other people might find the framework interesting 17/

Gergely Neu@neu_rips

what i particularly liked about the solution is that it does not require a direct parametrization of the control drift, but works directly with the value-driven structure of the optimal policy. this might have some serious practical advantages 16/

10:40 AM · May 22, 2026 · 280 Views

10:40 AM · May 22, 2026 · 405 Views

REPLY

#602Gergely Neu@NEU_RIPS

the paper is now on arxiv: https://arxiv.org/abs/2605.22507 check it out and let us know what you think!!! 18/END

Gergely Neu@neu_rips

i could go on all day, as i've been quite obsessed with these ideas over the last couple of months. none of us has worked on anything like this before, but we enjoyed the learning process greatly, and we hope that other people might find the framework interesting 17/

10:40 AM · May 22, 2026 · 405 Views

10:40 AM · May 22, 2026 · 408 Views