Skip to main content

Alibaba Cloud Tightens API Access for BaiLian Platform

Alibaba Cloud Adjusts API Rate Limits on BaiLian Platform

In a move to better manage platform resources, Alibaba Cloud announced significant changes to its API rate limiting policy for the BaiLian large model service platform. The new measures take effect April 28, 2026.

What's Changing

The multimodal interaction gateway will now have a default limit of 10 queries per second (QPS). According to Alibaba Cloud's calculations, this allocation should support:

  • 600 sessions per minute
  • 36,000 sessions per hour

The company maintains this quota will satisfy most development and routine business operation needs.

Who's Affected

Developers working with BaiLian's APIs should review their current usage patterns before the changes take effect. However, there's an important exception:

"Customers who previously upgraded their quotas through official channels won't see any changes to their existing permissions," an Alibaba Cloud spokesperson confirmed.

Why Now?

The adjustment reflects growing demand for large model services across Alibaba Cloud's customer base. By implementing more granular traffic controls, the company aims to:

  1. Ensure stable service for all users
  2. Better balance resources between individual developers and enterprise clients
  3. Maintain platform performance during peak usage periods

Key Points:

  • New rate limits take effect April 28, 2026
  • Default QPS set at 10 (600 sessions/minute)
  • Existing upgraded quotas remain unchanged
  • Developers should assess current usage patterns

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

Alibaba's Tongyi Lab unveils Fun-ASR 1.5 - a speech recognition model that masters 30 languages and even ancient poetry

Alibaba's Tongyi Lab has launched Fun-ASR1.5, a groundbreaking speech recognition model that understands 30 global languages, seven Chinese dialects, and even complex ancient poetry recitations. Now available on Alibaba Cloud's BaiLian platform, this technology promises to transform industries from education to finance with its remarkable accuracy across diverse linguistic contexts.

April 20, 2026
speech recognitionAI translationAlibaba Cloud
Alibaba's Qwen3.6-Max-Preview: A Programming Powerhouse Emerges
News

Alibaba's Qwen3.6-Max-Preview: A Programming Powerhouse Emerges

Alibaba has unveiled its latest AI model, Qwen3.6-Max-Preview, setting new standards in programming intelligence. This preview version outperforms its predecessor across multiple benchmarks, particularly in agent programming and world knowledge. While still in development, it's already showing promise as a game-changer for developers seeking advanced AI coding assistance.

April 20, 2026
AI ProgrammingAlibaba CloudQwen Series
News

Alibaba Cloud Tweaks API Limits for Smoother AI Development

Alibaba Cloud is adjusting rate limits for its BaiLian multimodal development kit, setting new default thresholds to improve service stability. Starting April 28, 2026, developers will get 10 requests per second - enough for most testing and business needs. Existing customers with custom agreements won't be affected by these changes.

April 20, 2026
Alibaba CloudAPI DevelopmentAI Tools
Google Bets Big on Custom AI Chips in Partnership With Marvell
News

Google Bets Big on Custom AI Chips in Partnership With Marvell

Google is doubling down on its AI hardware ambitions by teaming up with Marvell Technology to develop two specialized chips. The collaboration aims to create a memory processing unit to complement Google's TPUs and a next-generation TPU itself. This move could help Google reduce its dependence on Nvidia's dominant GPUs while boosting performance for its cloud services. The first chip could enter production as early as next year.

April 20, 2026
AI ChipsGoogleSemiconductors
AI Breakthrough: New Architecture Supercharges Language Models Across Data Centers
News

AI Breakthrough: New Architecture Supercharges Language Models Across Data Centers

Moonshot AI and Tsinghua University researchers have developed a clever solution to a growing problem in AI infrastructure. Their Pre-filling as a Service (PrfaaS) architecture tackles the computational bottlenecks plaguing large language models by splitting the workload across specialized data centers. Early tests show impressive results - think 54% faster processing and significantly reduced latency. This innovation couldn't come at a better time as AI systems increasingly strain against current technological limits.

April 20, 2026
AI InfrastructureMoonshot AILarge Language Models
Alibaba's New AI Model Packs Big Programming Smarts in Smaller Package
News

Alibaba's New AI Model Packs Big Programming Smarts in Smaller Package

Alibaba has unveiled Qwen3.6-35B-A3B, an open-source AI model that punches above its weight in programming tasks. Despite only activating 3 billion parameters at a time, this 'mixture of experts' model outperforms larger rivals while using less computing power. It shines in coding assistance, spatial reasoning, and visual understanding - already matching some premium AI services. Developers can now tap into this efficient brainpower through Alibaba's cloud platform.

April 17, 2026
AI programmingMixture of ExpertsAlibaba Cloud