Perplexity AI releases pplx-embed-v1-late-0.6b, a 0.6-billion-parameter late-interaction embedding model, on Hugging Face with per-token MaxSim optimization and multilingual support
——0——
Companion kernel delivers 3-5x speedup on Metal and CUDA.
QUOTE POST
#160Omar Khattab@LATEINTERACTION
oh! cool to see @perplexity_ai train late interaction (colbert) models
okay maybe it's a good time? We have a small colbert model trained at pplx, it is a continue-training of pplx-embed-0.6b, so native multilingual, just made it open and added a section how to use MaxSim kernel: https://huggingface.co/perplexity-ai/pplx-embed-v1-late-0.6b
5:07 PM · May 18, 2026 · 20.2K Views
8:12 PM · May 18, 2026 · 5K Views