Skip to main content

Baidu Upgrades AI Computing Platform to Version 5.0

Baidu Intelligent Cloud Unveils Bage AI Computing Platform 5.0

At the 2025 Baidu Cloud Intelligence Conference, Shen Dou, Executive Vice President of Baidu Group and President of Baidu Intelligent Cloud Business Group, announced the official upgrade of the Bage AI Computing Platform to version 5.0. This release focuses on breaking efficiency bottlenecks in AI computing through enhancements in four key areas: networking, computing power, inference systems, and training-inference integration.

Networking Improvements

The fifth version of the Bage platform achieves faster communication speeds and lower latency, significantly boosting the efficiency of model training and inference. These improvements are critical for large-scale AI applications requiring real-time data processing.

Enhanced Computing Power

Following the release of the Kunlun Chip Super Node at the Create 2025 Baidu AI Developer Conference, the upgraded Bage platform now integrates this technology into Baidu Intelligent Cloud's public cloud service. This provides users with supercomputing capabilities, enabling them to run trillion-parameter models with just a few minutes and a single cloud instance.

Inference System Upgrades

The new version introduces three core strategies—decoupling, adaptive, and intelligent scheduling—to improve throughput and reduce latency. These advancements ensure smoother performance for complex AI workloads.

Training-Inference Integration

Baidu also unveiled the Bage Reinforcement Learning Framework, designed to maximize computing resource utilization. This framework enhances both training and inference efficiency, making it easier for developers to deploy large-scale AI models.

Key Points:

  • Faster networking: Reduced latency and improved communication speeds.
  • Kunlun Chip Super Node: Enables trillion-parameter model execution.
  • Inference optimizations: Decoupling, adaptive strategies, and intelligent scheduling.
  • Reinforcement Learning Framework: Boosts training-inference integration efficiency.

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Tencent Boosts AI Team with Tsinghua Star Scientist Peng Tianyu
News

Tencent Boosts AI Team with Tsinghua Star Scientist Peng Tianyu

Tencent strengthens its AI research capabilities with the addition of Dr. Peng Tianyu, a rising star in machine learning from Tsinghua University. The young scholar will lead multi-modal reinforcement learning efforts for Tencent's Tongyi Large Model team, bringing his expertise in trustworthy AI and generative models. This hire follows Tencent's recent strategic moves to attract top AI talent as it competes in the global AI race.

January 30, 2026
TencentAI TalentMachine Learning
News

Baidu Bets Big: Doubles AI Cloud Growth Target Amid Market Boom

Baidu Intelligent Cloud has made a bold move, doubling its AI revenue growth target for 2026 from 100% to 200%. This aggressive stance comes as the company leverages its leading position in China's cloud bidding market and prepares for what analysts predict will be a $400 billion global AI cloud industry by 2030. With proven commercialization success and plans for increased R&D investments, Baidu aims to transform from market follower to industry leader.

January 28, 2026
Baidu Intelligent CloudAI Market TrendsCloud Computing
Alibaba's Qwen AI Gets a Brain Boost With New Reasoning Model
News

Alibaba's Qwen AI Gets a Brain Boost With New Reasoning Model

Alibaba has rolled out its most advanced reasoning model yet - Qwen3-Max-Thinking - powering its Qwen AI assistant on PC and web platforms. This trillion-parameter model sets new benchmarks in factual knowledge, complex problem-solving, and human-like reasoning, rivaling top global AI systems. Users can now experience smarter, more proactive interactions with enhanced memory and logical capabilities.

January 27, 2026
Artificial IntelligenceAlibabaMachine Learning
vLLM Creators Launch Inferact With $800M Valuation
News

vLLM Creators Launch Inferact With $800M Valuation

The team behind vLLM, the popular open-source AI inference engine, has unveiled Inferact - a new venture aiming to revolutionize AI deployment efficiency. Backed by $150M in seed funding from top investors including Andreessen Horowitz and Sequoia Capital, Inferact seeks to slash inference costs while accelerating AI adoption across industries.

January 23, 2026
AI InfrastructureMachine LearningTech Startups
Baidu's ERNIE Bot 5.0 Breaks New Ground with Brain-Like AI Capabilities
News

Baidu's ERNIE Bot 5.0 Breaks New Ground with Brain-Like AI Capabilities

Baidu has unveiled its revolutionary ERNIE Bot 5.0, featuring native full-modal technology that mimics human cognition. Unlike competitors' patchwork approaches, this 2.4 trillion-parameter model processes text, images, video and audio simultaneously - enabling remarkable feats like generating working code from app tutorials and crafting literature in classical styles. The breakthrough could redefine how we interact with artificial intelligence.

January 22, 2026
Artificial IntelligenceMachine LearningNatural Language Processing
Zhipu AI Rations Coding Plan Access Amid Computing Crunch
News

Zhipu AI Rations Coding Plan Access Amid Computing Crunch

Chinese AI firm Zhipu faces growing pains as its popular GLM-4.7 model strains computing resources. Starting January 23, daily sign-ups for its coding service will be slashed to just 20% of current capacity. The move aims to maintain quality for existing users during peak hours, though automatic renewal subscribers won't be affected.

January 21, 2026
AI ComputingZhipu AIResource Management