Skip to main content

Xiaomi Open-Sources Multimodal AI Model MiMo-VL-7B-2508

Xiaomi Open-Sources Advanced Multimodal AI Model

Xiaomi's AI research team has publicly released its MiMo-VL-7B-2508 multimodal large language model, marking a significant contribution to the open-source AI community. The release includes both Reinforcement Learning (RL) and Supervised Fine-Tuning (SFT) versions of the model.

Breakthrough Performance Metrics

The new model demonstrates exceptional capabilities across multiple domains:

  • Subject reasoning: Achieved 70+ on MMMU benchmark
  • Document understanding: Scored 94.4 on ChartQA
  • Graphical interface positioning: Reached 92.5 on ScreenSpot-v2
  • Video understanding: Improved to 70.8 on VideoMME

Image

Technical Enhancements

The latest iteration shows substantial improvements in:

  1. Reinforcement learning stability
  2. Supervised fine-tuning processes
  3. Internal VLM Arena score (increased from 1093.9 to 1131.2)

User-Centric Features

The model introduces innovative interaction modes:

  • Thinking mode: Displays complete reasoning chains (100% control success rate)
  • Non-thinking mode: Direct answer generation (99.84% success rate with faster responses) Users can toggle between modes using the /no_think instruction.

Available Model Versions

MiMo-VL-7B-RL-2508

MiMo-VL-7B-SFT-2508

Key Points

✅ New state-of-the-art performance in four core AI capabilities
✅ Dual-mode operation optimizes for accuracy or speed
✅ Fully open-sourced with commercial-friendly licensing
✅ Enhanced stability for reinforcement learning applications

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Google Goes All-In: Gemma 4 AI Models Hit Open Source with Apache 2.0 License
News

Google Goes All-In: Gemma 4 AI Models Hit Open Source with Apache 2.0 License

Google DeepMind just dropped a bombshell in the AI world - their powerful Gemma 4 models are now fully open source under Apache 2.0. This means developers can freely use and modify these cutting-edge AI tools for any purpose, including commercial projects. The lineup includes four specialized models, from a powerhouse 31B parameter version to compact options for mobile devices. Performance leaps are staggering, with math skills jumping from 20% to nearly 90% accuracy. Google's move signals a major shift in the open source AI landscape.

April 3, 2026
AIOpenSourceGoogleDeepMind
Xiaomi's MiMo AI Model Rolls Out Affordable Subscription Plans for Developers
News

Xiaomi's MiMo AI Model Rolls Out Affordable Subscription Plans for Developers

Xiaomi has unveiled a tiered subscription service for its MiMo large language model, offering developers flexible access starting at just 39 yuan per month. The four-tier system covers all modalities from text to audio processing, with the flagship MiMo-V2-Pro ranking among the world's top AI models. This move signals Xiaomi's push into commercial AI services while simplifying cost management for developers.

April 3, 2026
XiaomiAI modelstech subscriptions
News

Xiaomi's MiMo AI Model Goes Paid: Token Plans Start at 39 Yuan

Xiaomi has rolled out its first paid subscription plans for the MiMo large language model, offering four pricing tiers from 39 to 659 yuan per month. The plans give developers and AI enthusiasts access to three core models, marking Xiaomi's shift from free beta testing to monetizing its AI ecosystem. This move reflects the industry's broader transition towards sustainable AI business models.

April 3, 2026
XiaomiAI ModelsTech Subscriptions
Google's Gemma4 AI Model Goes Open-Source with Impressive Capabilities
News

Google's Gemma4 AI Model Goes Open-Source with Impressive Capabilities

Google has unveiled Gemma4, its latest open-source AI model series featuring four variants with groundbreaking capabilities. The lineup includes efficient E2B and E4B models for edge devices and powerful 26B MoE and 31B dense versions that rank among the world's top open-source models. What makes Gemma4 special? It supports images, videos, and even real-time voice processing while being remarkably accessible for local deployment.

April 3, 2026
Gemma4OpenSourceAIGoogleAI
News

Anthropic's GitHub Cleanup Backfires, Wiping Thousands of Legit Repos

In a dramatic case of overzealous damage control, AI company Anthropic accidentally deleted thousands of legitimate GitHub repositories while trying to remove leaked source code. What began as an effort to contain a security breach turned into a PR disaster when automated tools misfired, wiping out unrelated projects. The incident has sparked outrage among developers and raised questions about how tech giants handle crisis management in the open-source community.

April 2, 2026
AnthropicGitHubOpenSource
Gaode's ABot-M0 Gives Robots a Universal Brain
News

Gaode's ABot-M0 Gives Robots a Universal Brain

In a major leap for robotics, Gaode has open-sourced ABot-M0, the world's first unified architecture for robot intelligence. This 'universal brain' outperforms previous models by 30% on key benchmarks, while its complete open-source package—including algorithms and training data—could revolutionize how we develop smart robots for homes and industries.

April 1, 2026
roboticsAIopen-source