VLA is 95% certain about current action. Will it 95% succeed in the task?
Obviously, not necessarily. But if you’re clever, you can *calibrate* action prob. to task success.
Our #ICML2026 paper formulates this + SOTA algorithms based on new connection to RL temporal differences