Runway Adopts DTensor to Prevent Silent Gradient Bugs in Distributed Training
——0——
Sentiment
Pos33.3%
Neg66.7%
Some users praised a writeup on Runway adopting DTensor to prevent silent gradient bugs, while others called debugging it in PyTorch pretty painful and noted wasting time compared to easier sharding in JAX.