Hardware Designs Must Prioritize Latency Reduction For AI Inference From Start · Digg