Skip to main content

Tencent Unveils Low-Cost AI Optimization Method

Tencent's Breakthrough in Cost-Efficient AI Optimization

Tencent AI Lab has developed Training-Free GRPO (Gradient-based Policy Optimization), a revolutionary approach to optimizing large language models without traditional parameter fine-tuning. This innovation significantly reduces computational costs while delivering comparable performance improvements.

How Training-Free GRPO Works

The technology converts experiential knowledge into token-level prior information, allowing models to improve without altering their core parameters. By maintaining an external experience knowledge base dynamically, the method enhances capabilities while preserving the main model's architecture.

Image

Performance Improvements

Tests on DeepSeek-V3.1-Terminus showed notable gains:

  • Mathematical reasoning: Accuracy increased from 80% to 82.7% on AIME24 and from 67.9% to 73.3% on AIME25
  • Web search tasks: Pass@1 metric improved from 63.2% to 67.8%

The method achieved these results using just 100 cross-domain training samples, whereas traditional approaches typically require thousands.

Cost Comparison

The financial implications are staggering:

  • Traditional fine-tuning: ~70,000 RMB
  • Training-Free GRPO: ~120 RMB

The savings come primarily from avoiding computationally intensive operations like gradient backpropagation and parameter updates.

Image

Implications for AI Development

This breakthrough could democratize access to advanced AI optimization:

  • Enables smaller organizations with limited resources to enhance model performance
  • Maintains model generalization across domains
  • Opens new possibilities for efficient continuous learning systems

The research team acknowledges that further testing is needed across broader task categories beyond mathematical reasoning and information retrieval.

Paper Reference: Training-Free GRPO on arXiv

Key Points:

  • Achieves similar results as traditional fine-tuning at <0.2% of the cost
  • Works by updating external knowledge bases rather than model parameters
  • Demonstrated effectiveness in mathematical and search tasks
  • Particularly valuable for resource-constrained organizations

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Tencent's New Translation Tech Fits in Your Pocket
News

Tencent's New Translation Tech Fits in Your Pocket

Tencent has unveiled HY-MT1.5, a breakthrough translation system that brings powerful AI capabilities to mobile devices. The lightweight 1.8B version delivers near-instant translations while using minimal memory, perfect for smartphones. Meanwhile, the more robust 7B model excels at complex translations for enterprise use. What makes these models special? They combine massive training with human feedback to handle everything from technical jargon to cultural nuances - all while preserving document formatting.

January 5, 2026
machine translationAI modelsmobile technology
Chinese Researchers Teach AI to Spot Its Own Mistakes in Image Creation
News

Chinese Researchers Teach AI to Spot Its Own Mistakes in Image Creation

A breakthrough from Chinese universities tackles AI's 'visual dyslexia' - where image systems understand concepts but struggle to correctly portray them. Their UniCorn framework acts like an internal quality control team, catching and fixing errors mid-creation. Early tests show promising improvements in spatial accuracy and detail handling.

January 12, 2026
AI innovationcomputer visionmachine learning
News

Tencent's 'Upset Frog' Lets Gen Z Play Storyteller with AI

Tencent is testing an innovative mini-program called 'Upset Frog' that blends AI storytelling with user interaction. Unlike passive content platforms, it lets young users shape narratives through choices and commands, creating a social space around collaborative storytelling. While still in testing, this experiment could redefine digital entertainment for the TikTok generation.

January 9, 2026
GenerativeAIInteractiveMediaTencent
Fine-Tuning AI Models Without the Coding Headache
News

Fine-Tuning AI Models Without the Coding Headache

As AI models become ubiquitous, businesses face a challenge: generic models often miss the mark for specialized needs. Traditional fine-tuning requires coding expertise and expensive resources, but LLaMA-Factory Online changes the game. This visual platform lets anyone customize models through a simple interface, cutting costs and technical barriers. One team built a smart home assistant in just 10 hours - proving specialized AI doesn't have to be complicated or costly.

January 6, 2026
AI customizationno-code AImachine learning
Falcon H1R7B: The Compact AI Model Outperforming Larger Rivals
News

Falcon H1R7B: The Compact AI Model Outperforming Larger Rivals

The Abu Dhabi Innovation Institute has unveiled Falcon H1R7B, a surprisingly powerful 7-billion-parameter open-source language model that's rewriting the rules of AI performance. By combining innovative training techniques with hybrid architecture, this nimble contender delivers reasoning capabilities that rival models twice its size. Available now on Hugging Face, it could be a game-changer for developers needing efficient AI solutions.

January 6, 2026
AI innovationlanguage modelsmachine learning
Tencent's New AI Tool Turns Your Notes Into Polished Presentations
News

Tencent's New AI Tool Turns Your Notes Into Polished Presentations

Tencent's AI Workbench has introduced a game-changing PPT generator that taps into your personal knowledge base. Unlike generic tools, ima.copilot crafts slides tailored to your materials and logic. This innovation promises to streamline office work while maintaining creative authenticity - no more cookie-cutter presentations.

January 5, 2026
AI ProductivityTencentOffice Tech