Skip to main content

Alibaba International Unveils Ovis2.5, Advancing AI Visual and Reasoning Capabilities

Alibaba International Unveils Next-Gen AI Model Ovis2.5

Alibaba International has officially released Ovis2.5, its latest multimodal large model, now available as open-source. This next-generation AI focuses on native resolution visual perception, deep reasoning, and cost-effective scenario design, aiming to push the boundaries of artificial intelligence applications.

Image

Performance and Versions

The model has achieved a comprehensive score of 78.3 on the mainstream multimodal evaluation suite OpenCompass, outperforming many larger models and securing the top spot among open-source models with fewer than 40 billion parameters.

Ovis2.5 comes in two versions:

  • Ovis2.5-9B: Optimized for high-performance applications, scoring 78.3 on OpenCompass.
  • Ovis2.5-2B: Designed for edge-side and resource-constrained environments, scoring 73.9 while maintaining efficiency.

Architectural Innovations

The development team implemented systematic upgrades across three key areas:

  1. Model Architecture: Retains the series' structured embedding alignment design, featuring dynamic resolution visual feature extraction and enhanced language processing via Qwen3.
  2. Training Strategy: Employs a five-stage training plan including visual pre-training and large-scale instruction fine-tuning, with algorithms like DPO and GRPO to boost reasoning capabilities.
  3. Data Engineering: Increased training data by 50%, with emphasis on visual reasoning, charts, OCR, and Grounding tasks.

Availability and Applications

The code and models are now accessible on platforms including GitHub and Hugging Face, enabling developers worldwide to explore its potential across various AI applications.

Key Points:

  • 🚀 SOTA Performance: Scores 78.3 on OpenCompass, leading open-source models under 40B parameters.
  • ⚙️ Dual Versions: Ovis2.5-9B for high-power needs; Ovis2.5-2B for edge computing.
  • 📈 Enhanced Training: Five-stage strategy with preference alignment algorithms improves reasoning.
  • 🔍 Focus Areas: Expanded data targets visual reasoning, OCR, and structural understanding.

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

Xiaohongshu Shakes Up AI World by Open-Sourcing Its Relax Training Engine

In a surprising move, lifestyle platform Xiaohongshu has open-sourced its AI training engine called Relax, designed for multi-modal scenarios. This sophisticated tool handles text, images, audio and video through innovative parallel processing. The unexpected contribution from a non-traditional AI player signals the company's serious ambitions in artificial intelligence development and its desire to build influence in the tech community.

April 15, 2026
AIOpen SourceMachine Learning
OpenAI's New Toolkit Makes AI Assistants Safer for Businesses
News

OpenAI's New Toolkit Makes AI Assistants Safer for Businesses

OpenAI has rolled out significant upgrades to its Agents SDK, giving developers better tools to create secure AI assistants. The standout feature is a sandbox environment that prevents unpredictable AI behavior from causing system-wide issues. Businesses can now test AI agents more safely while leveraging OpenAI's models. The update also introduces an integrated framework for smoother development, with Python support available now and TypeScript coming soon.

April 16, 2026
OpenAIAI DevelopmentEnterprise Technology
Alibaba's 'Happy Oyster' Lets You Build 3D Worlds in Real-Time
News

Alibaba's 'Happy Oyster' Lets You Build 3D Worlds in Real-Time

Alibaba's ATH Group has opened beta testing for its innovative 'Happy Oyster' platform, a real-time 3D world creation tool that responds to user inputs as they work. The platform offers two creative modes - director and explorer - allowing users to shape dynamic environments for gaming, film, and other applications. Early adopters can now apply for access through the official website.

April 16, 2026
Alibaba3D modelingAI creativity
HarmonyGNN: A Breakthrough in AI's Understanding of Complex Relationships
News

HarmonyGNN: A Breakthrough in AI's Understanding of Complex Relationships

A new AI training method called HarmonyGNN is revolutionizing how computers understand complex relationships in data. Developed by researchers at North Carolina State University, this technique helps neural networks better distinguish between different types of connections in graph data, achieving accuracy improvements up to 9.6%. The innovation could have significant implications for fields like drug discovery and weather forecasting.

April 14, 2026
Artificial IntelligenceMachine LearningGraph Neural Networks
Xiaomi's AI Model Joins Leading Open-Source Framework with Free Trial
News

Xiaomi's AI Model Joins Leading Open-Source Framework with Free Trial

Xiaomi has integrated its MiMo-V2 AI model series into the Hermes Agent framework, a major player in open-source AI development. Developers can now access Xiaomi's Pro, Omni, and Flash models for free for two weeks. This partnership combines Xiaomi's hardware expertise with Hermes' self-evolving capabilities, offering new possibilities for AI assistants. The move signals a shift in AI competition from conversational quality to execution efficiency.

April 10, 2026
XiaomiAI DevelopmentOpen Source
News

Alibaba's HappyHorse gallops ahead in AI video race, topping ByteDance's model

A mysterious new AI model called HappyHorse-1.0 has sprinted to the front of China's text-to-video competition, scoring 1332 on the Elo rating system - nearly 60 points above ByteDance's Dreamina Seedance2.0. Industry insiders suggest the dark horse contender comes from Alibaba's Future Life Lab, now operating under the company's ATH business group. With Alibaba as its first social media follower, this breakthrough signals China's growing strength in sophisticated video generation technology.

April 10, 2026
AI video generationAlibabaHappyHorse