/Tech2h ago

Optimistic Online Algorithms Speed Convergence For Smooth Saddle Point Problems

414012.6K

Original post

However, one can do better if the problem is smooth (i.e., the gradient is Lipschitz). In this case, using two optimistic online learning algorithms, the convergence will be faster. Moreover, if you use optimistic online gradient descent/ascent, the last iterate will converge

Francesco Orabona@bremen79

If the problem is convex in the first variable and concave in the second one, we can use two online learning algorithms playing against each other. It is easy to show (e.g., see my online learning book) that the average of the iterates will converge to the saddle point.

8:06 AM · Jun 15, 2026 · 852 Views

Sentiment

Users are excited about ChatGPT 5.5 Pro completing a highly non-trivial proof by building on partial results from research on optimistic online algorithms for saddle point problems.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS594LIKES4REPLIES1

Francesco Orabona@bremen79

This is interesting because it means that the actual plays of the two players will converge to the optimal play, rather than the average of their plays. Now, what happens if we use optimistic multiplicative weights updates?

Francesco Orabona@bremen79

2h59440

Francesco Orabona@bremen79

One can still show that the convergence will be fast, yet it was unknown if the last iterate would converge too. People proved all sorts of partial results on this problem, but a general answer was still missing. I also tried to solve it on and off in the past few years.

Francesco Orabona@bremen79

2h58230

Francesco Orabona@bremen79

The difficulty is that the entropy function of the multiplicative updates is not differentiable on the boundary of the simplex, so certain limit operations fail

Intuitively, the Bregman is infinitely steep on the boundary, hence the iterates "slow down" even without convergence

Francesco Orabona@bremen79

2h57020

Francesco Orabona@bremen79

So, I tried ChatGPT.

But ChatGPT 5.5 Pro failed, no matter what I tried.

So, I used a different strategy: I wrote a pdf with everything I knew about this problem, including results that were not immediately useful, and told ChatGPT to complete the proof. This time it worked!

2h121

Francesco Orabona@bremen79

ChatGPT 5.5 Pro found a way to use one of my partial results to complete the proof, a highly non-trivial one.

The full pdf, including my prompting strategies, is here: https://arxiv.org/pdf/2606.11773

Please let me know what you think.

@SebastienBubeck, you might also like it :)

2h28

Francesco Orabona@bremen79

This is my second LLM-assisted proof. In both cases, the LLMs failed at first.

I believe this suggests the need for specific harnesses or prompting strategies that encourage exploring ways that are farther than 1-step away.

2h181