1d ago

PyTorch caching allocator produces exact deterministic memory allocations across runs when record stream usage is disabled, removing nondeterminism in GPU handling for distributed training

Technical exchange confirms approach feasibility for reproducibility goals.

2501262

——0——

Original post

@typedfemale So it's definitely feasible to have the exact allocations from the caching allocator be deterministic run to run. This mostly boils down to not using record stream. I would check on this!

4:29 PM · May 18, 2026

#233typedfemale@TYPEDFEMALE

@ezyang interesting, ok thanks i'll give it a try

Edward Z. Yang@ezyang

11:29 PM · May 18, 2026 · 134 Views

11:30 PM · May 18, 2026 · 128 Views

PyTorch caching allocator produces exact deterministic memory allocations across runs when record stream usage is disabled, removing nondeterminism in GPU handling for distributed training

Sentiment

Cluster engagement