vLLM
BIO
A high-throughput and memory-efficient inference and serving engine for LLMs. Join http://slack.vllm.ai to discuss together with the community!
BIO
A high-throughput and memory-efficient inference and serving engine for LLMs. Join http://slack.vllm.ai to discuss together with the community!
Eric Jang
@ericjang11
Marco Mascorro
@Mascobot
Researching LLMs | Roboticist | Prev: Partner @a16z, cofounder @Fellow_AI