Looking for an early hire in SF for helping with model optimization: spec decoding, GPU kernels, pooling infra, and wants to go deep on vLLM/sglang. Hunger/interest over experience in this case. If you know anyone, DMs open. They’d work directly with me.
8:37 AM · Jun 30, 2026 · 1.8K Views