@srush_nlp Single-sample as in a single sampled token per rollout (instead of topk logprobs)? My guess is that it's ease of implementation and lower memory requirements (e.g, Tinker only supports the single sample OPD version).
Sasha Rush@srush_nlp
@agarwl_ Why do you think there is so much focus on the single-sample version? I'm confused by the "negative" token arguments on twitter.
6:46 PM · Jun 15, 2026 · 885 Views