Smola Outlines LLM Inference Efficiency Across Hardware and Models · Digg