Resurfaced 2017 paper introduces hypergradient descent to dynamically tune learning rates during training · Digg