Skip to main content

Alibaba's New AI Algorithm Pushes Reasoning Limits Beyond OpenAI's Mini Model

Alibaba's AI Breakthrough: Thinking Deeper Than Ever Before

In a significant advancement for artificial intelligence, Alibaba's Tongyi Lab has developed FIPO - an algorithm that fundamentally changes how AI models approach complex reasoning tasks. This innovation comes at a time when the industry is grappling with the limitations of current reinforcement learning approaches.

Solving the Thinking Bottleneck

The core challenge FIPO addresses is what researchers call "reasoning length stagnation." Traditional models often get stuck when tackling multi-step problems like advanced mathematics. They struggle to identify which pieces of information truly matter for reaching the correct solution.

FIPO introduces two clever solutions:

  • Future-KL Mechanism: This rewards tokens that prove valuable for future reasoning steps, essentially teaching the AI to plan ahead
  • Symbolic Log Probability Difference: A technical innovation that helps the model recognize when it's making real progress versus going in circles

The results speak for themselves - average reasoning length jumped to over 10,000 tokens in testing, smashing previous limitations.

Outperforming the Competition

In head-to-head comparisons, Alibaba's 32B model equipped with FIPO demonstrated remarkable capabilities:

  • Surpassed similar-sized models using traditional approaches
  • Outperformed OpenAI's o1-mini on select metrics
  • Showed particular strength in mathematical reasoning tasks

"What excites us most," explains a Tongyi researcher, "is seeing the model maintain coherence across exceptionally long reasoning chains. It's like watching a student work through a complex proof without losing track of their argument."

The Bigger Picture for AI Development

This breakthrough comes as part of Tongyi Lab's broader push to enhance AI fundamentals. Just last month, they released CoPaw 1.0, another innovation focused on improving model interactions. Together, these developments suggest Chinese tech firms are making serious strides in core AI capabilities.

The implications extend beyond academic benchmarks. More capable reasoning could transform fields from scientific research to financial analysis where complex, multi-step problem solving is essential.

Key Points:

  • FIPO algorithm enables dramatically longer and more accurate reasoning chains
  • Outperforms comparable models including OpenAI's o1-mini
  • Particularly strong at mathematical and logical problems
  • Part of Alibaba's growing portfolio of fundamental AI innovations

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Alibaba's New Algorithm Helps AI Think More Like Humans
News

Alibaba's New Algorithm Helps AI Think More Like Humans

Alibaba's Tongyi Lab has developed a breakthrough algorithm called FIPO that helps large language models identify and focus on the most important parts of complex reasoning tasks. Unlike traditional methods that struggle to distinguish critical information, FIPO uses a novel 'Future-KL' mechanism to reward tokens that significantly impact later reasoning steps. Early tests show impressive results, with models handling reasoning chains over 10,000 tokens long while improving accuracy in mathematical problem-solving.

April 7, 2026
AI ResearchMachine LearningAlibaba
News

Alibaba's AI Model Hits Trillion Token Milestone, Tops Global Rankings

Alibaba's Qwen 3.6 Plus has made history by becoming the first AI model to surpass 10 trillion tokens in daily usage on OpenRouter, securing the top spot in global rankings. This achievement signals China's growing influence in the AI landscape, with domestic models gaining traction through competitive pricing and rapid innovation. Meanwhile, the capital market shows strong interest in AI technologies, with trading volumes hitting 1 trillion yuan on Chinese exchanges.

April 7, 2026
Artificial IntelligenceAlibabaOpenRouter
News

DeepSeek V4 Emerges: A Glimpse Into China's Next-Gen AI Powerhouse

The tech world is abuzz as DeepSeek V4 enters intensive testing, revealing three distinct versions tailored for different needs. From lightning-fast responses to advanced visual analysis, this homegrown AI showcases China's push for technological independence. What makes this release particularly exciting is its deep integration with domestic chips, signaling a strategic move away from foreign dependencies. As the AI arms race heats up, could this be the model that redefines what Chinese-developed artificial intelligence can achieve?

April 8, 2026
AI DevelopmentChinese TechMachine Learning
Google's Gemma 4: Small AI Models Pack a Big Punch
News

Google's Gemma 4: Small AI Models Pack a Big Punch

Google has open-sourced its Gemma 4 AI models, and they're turning heads in the tech world. What makes them special? Some of these compact models outperform giants 20 times their size, bringing powerful AI capabilities to everyday devices like smartphones. With optimized versions for mobile and IoT devices, Gemma 4 could change how we interact with AI in our daily lives.

April 7, 2026
AIMachine LearningGoogle
News

Google's Gemma 4: A Powerhouse AI Model Set to Shake Up Open-Source Landscape

Google is gearing up to unveil Gemma 4, its next-generation open-source AI model that promises four times the parameters of its predecessor. With a rumored 120 billion parameters and innovative MoE architecture, this release marks Google's strategic move to reclaim influence in the open-source AI space. The tech world watches closely as this development could redefine the balance between commercial and open-source AI models.

April 2, 2026
AI DevelopmentOpen Source TechMachine Learning
News

Alibaba and Shanghai AI Lab Tackle AI Safety in New White Paper

As AI evolves from chatbots to autonomous agents, safety concerns take center stage. Alibaba and Shanghai Artificial Intelligence Laboratory have teamed up to release a groundbreaking white paper addressing these risks. The document outlines a three-pronged approach focusing on corporate responsibility, social benefit, and industry collaboration. This comes as China's tech sector shifts its focus from raw computing power to responsible AI development.

April 1, 2026
AI SafetyAlibabaShanghai AI Lab