2h ago

Meituan Open-Sources LongCat-Video-Avatar 1.5 for Production Digital Humans

56613384.4K

——0——

Original post

Meet LongCat-Video-Avatar 1.5🐱—our upgraded, open-source digital human framework. Built for real production, not just short demos. What's New: 🔹 Upgraded Audio Encoder: Replaces Wav2Vec2 with Whisper-Large, yielding significantly smoother and more natural lip dynamics. 🔹 Production-Ready Stability: Achieves accurate lip-synchronization, full-body temporal stability, and robust long-video generation with strict identity consistency. 🔹 Stylized Domain Generalization: Robustly generalizes to anime, animals, and complex real-world conditions such as multi-person interactions and object handling. 🔹 Efficient 8-Step Inference: Advanced step distillation accelerates inference to 8 NFE, balancing cost-effective serving with exceptional visual fidelity. 📊 LongCat-Video-Avatar 1.5 performs strongly in realism, naturalness, and stability, outperforming leading open-source models and closed systems. 🐱 Avatar 1.5 framework is now open source: 🔗 Weights & Code:https://github.com/meituan-longcat/LongCat-Video 🔗 HuggingFace: https://huggingface.co/meituan-longcat/LongCat-Video-Avatar-1.5 🔗 Tech Report: https://github.com/meituan-longcat/LongCat-Video/blob/main/assets/LongCat-Video-Avatar-1.5-Tech-Report.pdf 🔗 Project Page: https://meigen-ai.github.io/LongCat-Video-Avatar-1.5-Page/

9:09 AM · May 21, 2026

Meituan Open-Sources LongCat-Video-Avatar 1.5 for Production Digital Humans

Sentiment

Cluster engagement