Skip to main content

TuSimple Unveils 'Ruyi' Image-to-Video Model and Ruyi-Mini-7B

TuSimple Unveils 'Ruyi' Image-to-Video Model and Ruyi-Mini-7B

Beijing, China — On December 17, 2024, TuSimple Future Technology Co., Ltd. officially announced the release of its first large model, Ruyi, as part of its TuSheng Video series. The company also open-sourced the Ruyi-Mini-7B version, which can be downloaded from the Hugging Face platform. Founded in 2015 and headquartered in San Diego, California, TuSimple focuses on applying AI technology across various industries, including animation, gaming, and transportation.

Features of the Ruyi Model

The Ruyi model is specifically designed to operate on consumer-grade graphics cards, providing users with detailed deployment instructions and workflows through ComfyUI, enabling quick setup and use. Its performance excels in frame consistency, motion fluidity, color representation, and composition, making it a promising tool for visual storytelling. Aiming to cater to anime and gaming enthusiasts, the model has undergone extensive training in these domains.

image

Ruyi supports multi-resolution and multi-duration video generation, capable of producing outputs ranging from 384×384 to 1024×1024 pixels, with any aspect ratio. Users can create videos of up to 120 frames or 5 seconds in length and have control over the generation of first frames and transitions between keyframes. The model also offers motion amplitude control and five types of shot control. Built on the DiT architecture, Ruyi comprises a Casual VAE module and a Diffusion Transformer, totaling approximately 7.1 billion parameters and was trained on around 200 million video clips.

Challenges and Future Improvements

Despite its advancements, Ruyi does face challenges, including issues with hand distortion, facial detail collapse in multi-person scenarios, and uncontrollable transitions. TuSimple is actively addressing these challenges to improve the model in future updates.

Looking ahead, TuSimple plans to maintain its focus on scene requirements and achieve breakthroughs in direct CUT generation. The company intends to offer two versions of the model in its next release, catering to the diverse needs of creators. By utilizing large models like Ruyi, TuSimple aims to reduce the development cycle and cost associated with creating anime and game content. The Ruyi model can already generate five seconds of footage by inputting keyframes or creating transitions between them, significantly expediting the development process.

Accessing Ruyi-Mini-7B

Developers and creators interested in exploring the Ruyi-Mini-7B model can access it via the following link:

Hugging Face Link

Key Points

  1. TuSimple launched its first large model, 'Ruyi', for image-to-video transformation.
  2. The Ruyi model is compatible with consumer-grade hardware, promoting accessibility.
  3. Future updates will address existing challenges and introduce new features to enhance performance.

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

China's AI-Powered Xpeng P7+ Hits Global Roads
News

China's AI-Powered Xpeng P7+ Hits Global Roads

Xpeng Motors has begun shipping its flagship 2026 P7+ electric sedan overseas, marking China's ambitious push into global AI-driven automotive markets. Dubbed 'the world's first AI car,' the P7+ boasts unprecedented 2250TOPS computing power and offers both electric and range-extended versions priced competitively between ¥186,800-198,800. Industry watchers see this as China's strategic move to establish dominance in intelligent driving technology.

February 2, 2026
electric vehiclesAI technologyChinese automakers
News

Survivalist Tests AI Limits in Freezing Snow Challenge

Film Hurricane founder Tim Pan is pushing boundaries again with an extreme survival challenge. Starting January 23rd, he'll brave -30°C temperatures in Northeast China for 100 hours armed only with paper, pencil, and AI image recognition technology. Supplies will come solely from drawings the AI can identify—a concept sparking both excitement and skepticism online. This follows Tim's recent uninhabited island survival feat that drew over 40 million viewers.

January 22, 2026
extreme survivalAI technologywinter challenge
News

LCK Star Roamer Silences AI Doubts With Stunning 93% Win Streak

When an account named '택배기사#한 진' tore through Korea's League of Legends ranked ladder with a jaw-dropping 93% win rate, players cried foul - surely this was AI at work. The truth proved more surprising: BRO mid-laner Roamer had simply entered god-tier form. His humble response? 'About 99% luck.' The revelation highlights the staggering skill gap between pros and regular players.

January 21, 2026
League of LegendsEsportsArtificial Intelligence
News

Smart Vacuums Take Over Homes as AI Cleaning Tech Surges

Robot vacuums have evolved from clumsy gadgets to intelligent home assistants, with global shipments jumping nearly 19% in 2025. Today's models can navigate around shoes and pet messes, respond to voice commands, and even learn cleaning preferences. Market data shows consumers increasingly value these smart features over raw suction power.

January 12, 2026
smart homeAI technologyconsumer electronics
Mistral AI's Voxtral Models Now Available on Amazon SageMaker
News

Mistral AI's Voxtral Models Now Available on Amazon SageMaker

Mistral AI has introduced its innovative Voxtral models, combining text and audio processing in powerful new ways. The smaller Voxtral-Mini handles quick transcriptions, while the robust Voxtral-Small tackles complex multilingual tasks. Amazon SageMaker now supports these models through flexible container deployment, opening doors for businesses to implement advanced audio-text intelligence solutions.

December 23, 2025
AI technologyVoice recognitionCloud computing
NVIDIA's NitroGen AI Learns Gaming Skills by Watching 40,000 Hours of YouTube
News

NVIDIA's NitroGen AI Learns Gaming Skills by Watching 40,000 Hours of YouTube

NVIDIA has unveiled NitroGen, a groundbreaking AI that mastered gaming by analyzing thousands of hours of player videos. Unlike specialized bots, this general-purpose agent can adapt to various game genres with surprising skill. The secret? Studying real players' controller inputs from YouTube and Twitch streams. Researchers say it performs 52% better than traditional AI when facing unfamiliar games. NVIDIA has open-sourced the project to accelerate development of versatile virtual agents.

December 22, 2025
AIGamingMachine Learning