Gergely Orosz warns that hosting open-weight models across multiple providers introduces unpredictable latency due to silent variant routing · Digg