2h ago

Meituan Open-Sources LongCat-Video-Avatar 1.5 for Production Digital Humans

โ€”โ€”0โ€”โ€”
Original post

Meet LongCat-Video-Avatar 1.5๐Ÿฑโ€”our upgraded, open-source digital human framework. Built for real production, not just short demos. What's New: ๐Ÿ”น Upgraded Audio Encoder: Replaces Wav2Vec2 with Whisper-Large, yielding significantly smoother and more natural lip dynamics. ๐Ÿ”น Production-Ready Stability: Achieves accurate lip-synchronization, full-body temporal stability, and robust long-video generation with strict identity consistency. ๐Ÿ”น Stylized Domain Generalization: Robustly generalizes to anime, animals, and complex real-world conditions such as multi-person interactions and object handling. ๐Ÿ”น Efficient 8-Step Inference: Advanced step distillation accelerates inference to 8 NFE, balancing cost-effective serving with exceptional visual fidelity. ๐Ÿ“Š LongCat-Video-Avatar 1.5 performs strongly in realism, naturalness, and stability, outperforming leading open-source models and closed systems. ๐Ÿฑ Avatar 1.5 framework is now open source: ๐Ÿ”— Weights & Code:https://github.com/meituan-longcat/LongCat-Video ๐Ÿ”— HuggingFace: https://huggingface.co/meituan-longcat/LongCat-Video-Avatar-1.5 ๐Ÿ”— Tech Report: https://github.com/meituan-longcat/LongCat-Video/blob/main/assets/LongCat-Video-Avatar-1.5-Tech-Report.pdf ๐Ÿ”— Project Page: https://meigen-ai.github.io/LongCat-Video-Avatar-1.5-Page/

9:09 AM ยท May 21, 2026 View on X