Modal Engineer Presents Stanford Talk On Transformer Inference Deployment · Digg