Skip to main content

Alibaba Cloud Introduces Major Price Cuts for AI Models

Alibaba Cloud Announces Significant Price Reductions for Qwen-VL Models

Alibaba Cloud has made headlines once again with a substantial price cut for its Qwen-VL series large models. This announcement marks the third round of price adjustments for the year, following reductions in May and September. The new pricing structure offers a remarkable decrease, with the Tongyi Qianwen series visual understanding models experiencing an overall reduction of over 80%.

The Qwen-VL-Plus model now costs only 0.0015 yuan per thousand tokens, a staggering 81% drop, while the higher-performance Qwen-VL-Max has been reduced to 0.003 yuan per thousand tokens, representing an 85% decrease. Under the new pricing model, users can process approximately 600 images at 720P or 1700 images at 480P for just 1 yuan.

image

Features of the Qwen-VL Series

The Qwen-VL series consists of multi-modal models that have gained popularity within the open-source community due to their robust visual reasoning capabilities. These models are designed to recognize images of various resolutions and aspect ratios, and they can comprehend extended videos exceeding 20 minutes. The versatility of Qwen-VL allows it to understand tasks performed by intelligent agents, such as mobile devices and robots, making it applicable across a wide range of visual recognition scenarios, including smartphones and automobiles.

image

Reasons Behind the Price Reduction

According to the Alibaba Cloud Bailian team, the recent price cuts are attributed to ongoing optimization of both the cloud infrastructure and model architecture. Additionally, the exponential growth in model usage has facilitated economies of scale. Continuous technological advancements and optimizations have also led to significant improvements in inference efficiency.

The introduction of an elastic AI computing power scheduling system, in tandem with the Bailian distributed inference acceleration engine, has effectively reduced model inference costs and accelerated inference times. Alibaba Cloud has noted that as the visual understanding capabilities of Qwen-VL improve, the model has emerged as one of the fastest-growing options on the Bailian platform.

New KV Cache Billing Model

To enhance cost efficiency for users utilizing the large model API, Alibaba Cloud Bailian has rolled out a new KV Cache billing model. This innovative model significantly lowers invocation costs by automatically caching context, thereby avoiding redundant computations. The KV Cache model is particularly advantageous for applications involving long texts, code completion, multi-turn dialogues, and specific text summarization.

Implications for AI Accessibility

With these price reductions, Alibaba Cloud is not only making AI technology more accessible but is also opening up new opportunities for developers and enterprises. By continuously enhancing performance and reducing usage costs, Alibaba Cloud is driving the widespread adoption and application of AI technology, which provides robust technical support for the digital transformation across various industries.

Key Points

  1. Alibaba Cloud has reduced prices for its Qwen-VL series models by over 80%.
  2. Users can now process 600 images for just 1 yuan.
  3. The price reduction is attributed to infrastructure optimizations and increased model usage.
  4. A new KV Cache billing model has been introduced to further reduce costs for users.
  5. These changes promote greater accessibility to AI technologies for developers and businesses.

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Alibaba's Homegrown AI Chip Takes on NVIDIA with Zhenwu 810E Launch
News

Alibaba's Homegrown AI Chip Takes on NVIDIA with Zhenwu 810E Launch

Alibaba's semiconductor arm Tengtouge has unveiled its powerful new AI processor, the Zhenwu 810E, marking a significant milestone in China's chip development. This fully self-developed chip outperforms NVIDIA's A800 and rivals the H20 in key metrics, while boasting impressive 700GB/s inter-chip bandwidth. Already deployed across Alibaba Cloud's infrastructure, the chip promises seamless integration with existing AI ecosystems.

January 29, 2026
AI ChipsSemiconductorsAlibaba Cloud
Xiaomi's AI Model Goes Paid: What You Need to Know
News

Xiaomi's AI Model Goes Paid: What You Need to Know

Xiaomi has introduced payment options for its MiMo-V2-Flash AI model while offering free quotas to users. The tech giant revealed pricing details for both domestic and international markets, with the model boasting impressive 309 billion parameters. While requiring real-name verification in China, overseas users enjoy simpler payment methods through Apple Pay and credit cards.

January 21, 2026
XiaomiAI ModelsTech News
News

Alibaba Cloud's New Kit Brings AI Smarts to Everyday Gadgets

Alibaba Cloud has unveiled a game-changing development kit that packages its powerful AI models into ready-to-use tools for hardware makers. The kit combines speech, vision, and language capabilities to help devices like smart glasses and robots understand and interact with users naturally. With pre-built features ranging from homework help to creative tools, manufacturers can now add human-like intelligence to their products in weeks rather than months.

January 8, 2026
Alibaba CloudAI hardwaresmart devices
Xiaomi Gives Users Extra Time With Its MiMo AI – Free Trial Extended
News

Xiaomi Gives Users Extra Time With Its MiMo AI – Free Trial Extended

Xiaomi is giving AI enthusiasts more time to play with its powerful MiMo-V2-Flash model. Originally set to expire at the end of December 2025, the free trial period has been extended by 20 days until January 20, 2026. This open-source model packs serious power with 309 billion parameters and shines in reasoning and coding tasks. While keeping access free for now, Xiaomi is preparing to roll out payment options soon.

December 31, 2025
XiaomiAI ModelsTech News
News

Tongyi Qianwen's Qwen Code Evolves Into Developer Powerhouse

Alibaba Cloud's Tongyi Lab has unveiled Qwen Code v0.5.0, transforming the AI coding assistant from a simple command-line tool into a comprehensive development ecosystem. The upgrade brings smarter IDE integration, better project understanding, and new collaboration features tailored for Chinese developers working with local tech stacks like Spring Boot and HarmonyOS.

December 26, 2025
AI ProgrammingDeveloper ToolsChinese Tech
News

Alibaba Cloud Joins Forces with AiShi Tech to Supercharge AI Video Globally

In a strategic move that could reshape the AI video landscape, AiShi Technology has partnered with Alibaba Cloud to accelerate global expansion. The deal will see Alibaba provide full-stack AI support for AiShi's popular PixVerse platform, which already boasts over 100 million users worldwide. This collaboration comes as PixVerse claims top rankings in global AI video benchmarks, signaling China's growing influence in generative media technologies.

December 17, 2025
AI video generationAlibaba CloudPixVerse