Skip to main content

Tencent's SRPO Tech Enhances AI Image Realism

Tencent's Breakthrough in AI Image Generation

Tencent's Hunyuan research team has developed Semantic Relative Preference Optimization (SRPO), a novel approach that significantly enhances the realism of AI-generated images. This technology specifically targets the unnatural "oily" appearance often seen in character skins produced by popular open-source models like Flux.

The Challenge of Realistic AI Images

As digital art gains popularity, the demand for high-quality AI-generated visuals has skyrocketed. However, current text-to-image models frequently produce character skins that appear unnaturally smooth and artificial - a phenomenon researchers describe as the "oily" effect.

Image

How SRPO Works

The breakthrough came through collaboration between Tencent, Chinese University of Hong Kong (Shenzhen), and Tsinghua University. SRPO introduces semantic preference concepts by:

  • Adjusting reward model objectives using control prompts (e.g., "realism")
  • Implementing positive/negative word guidance to balance reward bias
  • Employing Direct-Align strategy for better noise control

The team discovered that traditional methods focusing only on later generation stages caused overfitting. Their innovative solution injects controllable noise as reference points for reconstruction.

Image

Remarkable Efficiency Gains

SRPO demonstrates unprecedented training efficiency:

  • 3x improvement in realism/aesthetic scores
  • 75x faster than conventional methods (just 10 minutes)
  • Outperforms existing DanceGRPO approach

The technology's ability to optimize early generation stages prevents high-frequency information overfitting while maintaining precise reward signal transmission.

Image

Future Implications

This advancement promises to revolutionize digital art creation by:

  • Delivering more natural-looking character renders
  • Reducing post-processing requirements
  • Opening new creative possibilities for artists and developers

The research is publicly available on Tencent's project page.

Key Points:

  • SRPO addresses the "oily skin" problem in AI-generated images
  • Uses semantic preference optimization and Direct-Align strategy
  • Achieves major quality improvements with minimal training time
  • Potential to transform digital art and content creation workflows

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

DeepSeek V4 Arrives: A Multimodal AI Powerhouse

DeepSeek is gearing up to launch its V4 model, a significant upgrade featuring image, video, and text generation capabilities. The new version promises better compatibility with domestic chips and introduces a 'lite' variant with a massive 1 million token context window. With potential parameter counts reaching into the trillions, this release could redefine what's possible in multimodal AI applications.

March 2, 2026
AI innovationmultimodal technologydeep learning
ByteDance's Seedream 5.0 Lite: Your New AI-Powered Visual Thinking Partner
News

ByteDance's Seedream 5.0 Lite: Your New AI-Powered Visual Thinking Partner

ByteDance has unveiled Seedream 5.0 Lite, an image creation model that thinks before it draws. Unlike previous versions that simply followed instructions, this AI now understands context, reasons visually, and taps into real-time data. Imagine an assistant that doesn't just create images but collaborates with you - whether you're designing infographics, editing photos, or visualizing complex concepts. The model's ability to grasp physical laws and specialized knowledge makes it particularly useful for professionals needing accurate technical illustrations.

February 13, 2026
AI image generationvisual reasoningByteDance
Alibaba's Qwen-Image-2.0 Merges Creation and Editing in Stunning 2K Detail
News

Alibaba's Qwen-Image-2.0 Merges Creation and Editing in Stunning 2K Detail

Alibaba Cloud has unveiled Qwen-Image-2.0, a groundbreaking AI model that combines image generation and editing into one seamless package. This lightweight 7B architecture delivers breathtaking 2K resolution images with pixel-perfect text rendering and realistic textures. From ancient calligraphy to modern infographics, it handles diverse creative tasks while maintaining character consistency across complex scenes. The model is now available for testing through Alibaba Cloud's BaiLian platform.

February 10, 2026
AI image generationAlibaba CloudComputer vision
News

AI Luminary Peng Tianyu Takes Helm at Tencent Hunyuan's Multimodal Research

Peng Tianyu, a rising star in AI research with deep roots at Tsinghua University, has joined Tencent's Hunyuan division as Chief Research Scientist. The machine learning expert will spearhead advancements in multimodal reinforcement learning, blending visual and language AI capabilities. With an impressive track record that includes prestigious awards and publications at top conferences, Peng's move signals Tencent's commitment to pushing boundaries in generative AI technologies.

January 30, 2026
AI ResearchTencent HunyuanMultimodal Learning
News

Hikvision's AI Inspector Tackles Factory Packaging Errors

Hikvision has unveiled a smart quality control system powered by its Guanlan AI model that spots packaging mistakes instantly. Unlike traditional manual checks, this solution scans every item with precision, adapting to complex production environments. Already proving valuable in automotive and electronics plants, it marks another step toward smarter manufacturing.

January 30, 2026
industrial automationquality controlcomputer vision
Small AI Model Packs Big Punch: Step3-VL-10B Challenges Giants
News

Small AI Model Packs Big Punch: Step3-VL-10B Challenges Giants

StepZen's new open-source vision-language model Step3-VL-10B is turning heads in AI circles. Despite its compact 10 billion parameters, it's outperforming models twenty times its size in visual reasoning and math competitions. The secret? Innovative training techniques that could revolutionize how we deploy AI on everyday devices.

January 20, 2026
AI innovationcomputer visionedge computing