/Tech4h ago

New Deterministic Algorithm Achieves Optimal Multicalibration and Omniprediction

4151101.8K

Original post

For a long time we didn't know if test-time randomization was needed for sample-optimal multicalibration and omniprediction. It's not. https://arxiv.org/abs/2606.20557

5:09 AM · Jun 19, 2026 · 1.1K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

ARXIV.ORGVia

#643

Posts from X

Most Activity

VIEWS320LIKES3REPLIES1

Aaron Roth@Aaroth

The best algorithms for multicalibration/omniprediction were for the online adversarial setting, where randomness really is needed. They could be applied to the standard distributional setting using an "online to batch reduction" but that introduced even more randomness.

Aaron Roth@Aaroth

For a long time we didn't know if test-time randomization was needed for sample-optimal multicalibration and omniprediction. It's not. https://arxiv.org/abs/2606.20557

2h32030

BOOKMARKS1

Aaron Roth@Aaroth

You could try to derandomize these algorithms by fixing their randomness, but here's an obstruction. The distribution is uniform over (red, 1) and (blue, 0). The predictor predicts f(red) = 2/3) and f(blue) = 1/3 w.p. 2/3 and f(red) = 1/3 and f(blue) = 2/3 w.p. 1/3

Aaron Roth@Aaroth

2h30901

RETWEETS1

Aaron Roth@Aaroth

But what if your online learner had a hint: Every day t, it received an interval [a_t,b_t] and the promise that the true mean was in the interval. Now it can learn optimally and randomize only within the interval. But where can you get a hint from? Online it would be impossible.

2h1511

Aaron Roth@Aaroth

But if you are running an online to batch reduction, you can get valid hints by forming confidence intervals around conditional means for points you see frequently in your training sample. The width of the interval hint now scales naturally with the frequency of the point.

2h40

Aaron Roth@Aaroth

This predictor is perfectly calibrated, but no deterministic predictor with range {1/3, 2/3} can be calibrated here, so fixing randomness fails. If you think about it for a bit, the problem is the combination of high variance of prediction and high weight in the distribution.

Aaron Roth@Aaroth

2h2300

Aaron Roth@Aaroth

So do this. You get a randomized predictor whose feature conditional variance scales smoothly with the frequency of x. And once you have this you have avoided the obstruction in the red/blue example above, and fixing the randomness works!

2h39