
We propose to densify the supervision with stochastic injections - pre-train noise→target, then fine-tune onto source→target - jitter source samples with Gaussian noise - perturb the interpolant path
All three are training-time only, basically computationally free, no architecture changes, and you can sample with a plain ODE (no expensive SDE solver).
Beats vanilla flow by 13 FID & baselines by 9, across diverse scientific domains: 🔬🌄🛰️🩻🌌.