Over the past 90 days I dropped everything to study Zeroth-Order Optimization. Forgotten techniques like Evolution Strategies, which can train models without backprop.
I also wrote multiple codebases. ZOTitan for training, a kernel autoresearch harness, and kernels.


