Meituan Unveils LongCat-Video Model for 5-Minute AI-Generated Content
Meituan Introduces Advanced Video Generation AI
Chinese tech giant Meituan has officially released LongCat-Video, its latest artificial intelligence model specializing in video generation. This development marks a significant advancement in AI's ability to understand and reconstruct the real world through dynamic visual media.
Technical Architecture and Capabilities
The model is built on the Diffusion Transformer (DiT) architecture, enabling it to handle multiple video generation tasks seamlessly:
- Text-to-video generation: Produces 720p HD videos at 30fps with accurate interpretation of text prompts
- Image-to-video generation: Preserves all features of reference images while creating physically plausible animations
- Video continuation: Extends existing footage while maintaining logical consistency

Breakthrough in Long-Form Content
LongCat-Video's most notable achievement is its ability to generate continuous 5-minute videos without quality degradation. The model employs several innovative techniques:
- Advanced temporal consistency algorithms
- Physical movement rationality checks
- Block sparse attention mechanisms
- Conditional token caching systems
These features collectively solve the longstanding challenge of balancing length and quality in AI-generated video content.
Performance Optimization
The model demonstrates exceptional efficiency through:
- Multiple inference speed optimization strategies
- Consistent performance across internal and public benchmarks
- Leading results in open-source video generation metrics

The release opens new possibilities for content creators by simplifying long-form video production while maintaining professional quality standards.
Availability
The model is accessible through:
Key Points:
- Innovative Architecture: Based on Diffusion Transformer technology for versatile video generation
- Multi-Task Support: Handles text-to-video, image-to-video, and video continuation without additional adaptation
- Extended Duration: Stable output of continuous 5-minute videos sets new industry standard
- Quality Maintenance: Advanced techniques prevent color drift and quality degradation