Tencent's AI Painting Breakthrough Boosts Image Quality 300%
Tencent's AI Painting Breakthrough Delivers 300% Quality Improvement
Tencent has developed groundbreaking fine-tuning techniques that significantly enhance the quality of AI-generated images, achieving 300% improvements in human evaluation scores. The new methods address persistent challenges in diffusion models while enabling unprecedented control over output aesthetics.
The Challenge with Current Models
While existing diffusion models can optimize images through reward mechanisms, they face two critical limitations:
- Reward cheating: Models generate low-quality images that technically achieve high scores
- Inflexible adjustment: Offline reward models prevent real-time optimization

Tencent's Innovative Solutions
The research team introduced two novel approaches:
Direct-Align Technology
This method allows the model to recover original images from any point in the generation process by pre-injecting noise. Key benefits include:
- Reduces gradient explosion during backpropagation
- Enables optimization throughout the entire diffusion process (not just final steps)
- Improves training stability
Semantic Relative Preference Optimization (SRPO)
SRPO transforms reward signals into text-controlled parameters, allowing:
- Style adjustments through simple prompt modifications (e.g., adding "bright" or "dark" prefixes)
- No requirement for additional training data
- Real-time customization of output characteristics
Performance Results
The FLUX.1-dev model trained with SRPO demonstrated remarkable improvements:
- Realism excellent rate increased from 8.2% to 38.9%
- Aesthetic quality excellent rate rose from 9.8% to 40.5%
- Achieved natural textures while maintaining high visual appeal
The technology achieves these results with efficient training - converging in just 10 minutes using 32 H20 GPUs.
Future Implications
This advancement represents a significant leap forward for:
- Professional digital art creation tools
- Marketing and advertising content generation
- Game asset development pipelines
The research paper is available at: https://arxiv.org/pdf/2509.06942
Key Points:
- Tencent's new methods improve AI image quality by 300%
- Direct-Align enables full-process optimization
- SRPO allows text-based style control without extra data
- Significant improvements in realism and aesthetics demonstrated
- Technology converges rapidly with efficient GPU usage




