Skip to main content

Ant Group's BaiLing Team Open Sources Efficient AI Model

Ant Group's BaiLing Team Releases Revolutionary AI Model

Amid fierce competition in AI development, Ant Group's BaiLing large model team has open-sourced Ring-flash-linear-2.0-128K, a groundbreaking model designed specifically for ultra-long text programming applications. This release marks a significant advancement in efficient AI inference and long-context processing.

Image

Hybrid Architecture Delivers Unprecedented Efficiency

The model features an innovative hybrid linear + standard attention mechanism combined with a sparse MoE (Mixture of Experts) architecture. With total parameters scaled at 104B but only 6.1B activated during operation (4.8B excluding embeddings), the system achieves:

  • Near-linear time complexity
  • Constant space complexity
  • Generation speeds exceeding 200 tokens/second at 128K context on H20 hardware
  • Three times faster daily use speeds compared to traditional models

The architecture is particularly optimized for resource-limited scenarios while maintaining performance comparable to 40B dense models.

Enhanced Training Yields Superior Reasoning Capabilities

Building upon the Ling-flash-base-2.0 foundation, the model underwent:

  • Additional training on 1T tokens of high-quality data
  • Stable supervised fine-tuning (SFT)
  • Multi-stage reinforcement learning (RL)

The training process overcame traditional instability issues in MoE long-chain reasoning through Ant's proprietary "Icepop" algorithm. Benchmark results demonstrate exceptional capabilities:

  • 86.98 score in AIME2025 math competition
  • 90.23 Elo rating in CodeForces programming tests
  • Outperforms 40B dense models like Qwen3-32B in logical reasoning and creative writing tasks

Image

Long Context Handling Redefines Programming Efficiency

The model natively supports 128K context windows, expandable to 512K using YaRN extrapolation technology. Performance highlights include:

  • Prefill phase throughput nearly 5× higher than Qwen3-32B
  • Decoding phase achieving 10× acceleration
  • Maintains high accuracy even in 32K+ context programming tasks without "model leakage" issues The system proves particularly effective for:

  • Front-end development
  • Structured code generation
  • Agent simulation scenarios

    Open Source Availability Accelerates Adoption

    The BaiLing team has made the model available on:

    /div>">Hugging Face ">ModelScope ">">">">">">">">">">">Support includes BF16/FP8 formats and easy integration with popular frameworks like Transformers, SGLang, and vLLM."";"""Technical documentation is available on arXiv (https://arxiv.org/abs/2510.19338).""",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,""",,,,"",,"",,"",,"",,"",,"",,"",,"",,"",,"".'''''''''''''''',,,,,,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''',,,,,,,,''''','', '''Key Points:'''''- Combines hybrid linear attention with MoE architecture'- Achieves SOTA performance with only 6.1B activated parameters'- Native 128K context support expandable to 512K'- Sevenfold efficiency improvement over previous versions'- Available now on Hugging Face and ModelScope

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Tencent Boosts AI Team with Tsinghua Star Scientist Peng Tianyu
News

Tencent Boosts AI Team with Tsinghua Star Scientist Peng Tianyu

Tencent's AI ambitions get another boost as machine learning expert Peng Tianyu joins their Tongyi Large Model team. The Tsinghua PhD, known for his work on robust machine learning, will lead multi-modal reinforcement learning research. This marks Tencent's latest high-profile hire following former OpenAI researcher Yao Shunyu's recent appointment.

January 30, 2026
TencentArtificial IntelligenceMachine Learning
News

SenseTime's New AI Detective Can Think and Act Like Humans

SenseTime has unveiled SenseNova-MARS, a groundbreaking AI model that mimics human reasoning and action-taking abilities. This open-source visual language model outperforms GPT-5.2 in several benchmarks, excelling at tasks requiring detailed image analysis, information retrieval, and complex reasoning. What sets it apart is its ability to autonomously plan and execute multi-step investigations - zooming in on tiny details, searching relevant information, and drawing logical conclusions just like a human detective would.

January 30, 2026
Artificial IntelligenceComputer VisionMachine Learning
News

OpenAI Retires GPT-4o as Users Embrace Newer AI Models

OpenAI is sunsetting several older AI models, including the once-popular GPT-4o, as users overwhelmingly shift to newer versions like GPT-5.2. The company cites significant improvements in personalization and creative thinking capabilities as reasons for phasing out these legacy models. Along with GPT-4o, several 'mini' and reasoning models will also be discontinued, marking a consolidation of OpenAI's offerings to focus on more advanced technology.

January 30, 2026
OpenAIGPT-4AI Development
Alibaba's Qwen AI Gets a Brain Boost With New Reasoning Model
News

Alibaba's Qwen AI Gets a Brain Boost With New Reasoning Model

Alibaba has rolled out its most advanced reasoning model yet - Qwen3-Max-Thinking - powering its Qwen AI assistant on PC and web platforms. This trillion-parameter model sets new benchmarks in factual knowledge, complex problem-solving, and human-like reasoning, rivaling top global AI systems. Users can now experience smarter, more proactive interactions with enhanced memory and logical capabilities.

January 27, 2026
Artificial IntelligenceAlibabaMachine Learning
vLLM Creators Launch Inferact With $800M Valuation
News

vLLM Creators Launch Inferact With $800M Valuation

The team behind vLLM, the popular open-source AI inference engine, has unveiled Inferact - a new venture aiming to revolutionize AI deployment efficiency. Backed by $150M in seed funding from top investors including Andreessen Horowitz and Sequoia Capital, Inferact seeks to slash inference costs while accelerating AI adoption across industries.

January 23, 2026
AI InfrastructureMachine LearningTech Startups
Baidu's ERNIE Bot 5.0 Breaks New Ground with Brain-Like AI Capabilities
News

Baidu's ERNIE Bot 5.0 Breaks New Ground with Brain-Like AI Capabilities

Baidu has unveiled its revolutionary ERNIE Bot 5.0, featuring native full-modal technology that mimics human cognition. Unlike competitors' patchwork approaches, this 2.4 trillion-parameter model processes text, images, video and audio simultaneously - enabling remarkable feats like generating working code from app tutorials and crafting literature in classical styles. The breakthrough could redefine how we interact with artificial intelligence.

January 22, 2026
Artificial IntelligenceMachine LearningNatural Language Processing