Tencent's SRPO Tech Enhances AI Image RealismWelcome to AI DAMN! Discover the most amazing latest AI news, innovative AI products, and groundbreaking AI projects. From ChatGPT to cutting-edge models, we curate the AI developments that make you go 'DAMN!' - your daily dose of mind-blowing artificial intelligence.

Discover

Language

Account

Tencent's SRPO Tech Enhances AI Image Realism

Tencent's Breakthrough in AI Image Generation

Tencent's Hunyuan research team has developed Semantic Relative Preference Optimization (SRPO), a novel approach that significantly enhances the realism of AI-generated images. This technology specifically targets the unnatural "oily" appearance often seen in character skins produced by popular open-source models like Flux.

The Challenge of Realistic AI Images

As digital art gains popularity, the demand for high-quality AI-generated visuals has skyrocketed. However, current text-to-image models frequently produce character skins that appear unnaturally smooth and artificial - a phenomenon researchers describe as the "oily" effect.

How SRPO Works

The breakthrough came through collaboration between Tencent, Chinese University of Hong Kong (Shenzhen), and Tsinghua University. SRPO introduces semantic preference concepts by:

Adjusting reward model objectives using control prompts (e.g., "realism")
Implementing positive/negative word guidance to balance reward bias
Employing Direct-Align strategy for better noise control

The team discovered that traditional methods focusing only on later generation stages caused overfitting. Their innovative solution injects controllable noise as reference points for reconstruction.

Remarkable Efficiency Gains

SRPO demonstrates unprecedented training efficiency:

3x improvement in realism/aesthetic scores
75x faster than conventional methods (just 10 minutes)
Outperforms existing DanceGRPO approach

The technology's ability to optimize early generation stages prevents high-frequency information overfitting while maintaining precise reward signal transmission.

Future Implications

This advancement promises to revolutionize digital art creation by:

Delivering more natural-looking character renders
Reducing post-processing requirements
Opening new creative possibilities for artists and developers

The research is publicly available on Tencent's project page.

Key Points:

SRPO addresses the "oily skin" problem in AI-generated images
Uses semantic preference optimization and Direct-Align strategy
Achieves major quality improvements with minimal training time
Potential to transform digital art and content creation workflows

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

News

DeepSeek V4 Arrives: A Multimodal AI Powerhouse

DeepSeek is gearing up to launch its V4 model, a significant upgrade featuring image, video, and text generation capabilities. The new version promises better compatibility with domestic chips and introduces a 'lite' variant with a massive 1 million token context window. With potential parameter counts reaching into the trillions, this release could redefine what's possible in multimodal AI applications.

March 2, 2026

AI innovationmultimodal technologydeep learning

News

ByteDance's Seedream 5.0 Lite: Your New AI-Powered Visual Thinking Partner

ByteDance has unveiled Seedream 5.0 Lite, an image creation model that thinks before it draws. Unlike previous versions that simply followed instructions, this AI now understands context, reasons visually, and taps into real-time data. Imagine an assistant that doesn't just create images but collaborates with you - whether you're designing infographics, editing photos, or visualizing complex concepts. The model's ability to grasp physical laws and specialized knowledge makes it particularly useful for professionals needing accurate technical illustrations.

February 13, 2026

AI image generationvisual reasoningByteDance

News

Alibaba's Qwen-Image-2.0 Merges Creation and Editing in Stunning 2K Detail

Alibaba Cloud has unveiled Qwen-Image-2.0, a groundbreaking AI model that combines image generation and editing into one seamless package. This lightweight 7B architecture delivers breathtaking 2K resolution images with pixel-perfect text rendering and realistic textures. From ancient calligraphy to modern infographics, it handles diverse creative tasks while maintaining character consistency across complex scenes. The model is now available for testing through Alibaba Cloud's BaiLian platform.

February 10, 2026

AI image generationAlibaba CloudComputer vision

News

AI Luminary Peng Tianyu Takes Helm at Tencent Hunyuan's Multimodal Research

Peng Tianyu, a rising star in AI research with deep roots at Tsinghua University, has joined Tencent's Hunyuan division as Chief Research Scientist. The machine learning expert will spearhead advancements in multimodal reinforcement learning, blending visual and language AI capabilities. With an impressive track record that includes prestigious awards and publications at top conferences, Peng's move signals Tencent's commitment to pushing boundaries in generative AI technologies.

January 30, 2026

AI ResearchTencent HunyuanMultimodal Learning

News

Hikvision's AI Inspector Tackles Factory Packaging Errors

Hikvision has unveiled a smart quality control system powered by its Guanlan AI model that spots packaging mistakes instantly. Unlike traditional manual checks, this solution scans every item with precision, adapting to complex production environments. Already proving valuable in automotive and electronics plants, it marks another step toward smarter manufacturing.

January 30, 2026

industrial automationquality controlcomputer vision

News

Small AI Model Packs Big Punch: Step3-VL-10B Challenges Giants

StepZen's new open-source vision-language model Step3-VL-10B is turning heads in AI circles. Despite its compact 10 billion parameters, it's outperforming models twenty times its size in visual reasoning and math competitions. The secret? Innovative training techniques that could revolutionize how we deploy AI on everyday devices.

January 20, 2026

AI innovationcomputer visionedge computing

Tencent's SRPO Tech Enhances AI Image Realism

Tencent's Breakthrough in AI Image Generation

The Challenge of Realistic AI Images

How SRPO Works

Remarkable Efficiency Gains

Future Implications

Key Points:

Enjoyed this article?

Related Articles

DeepSeek V4 Arrives: A Multimodal AI Powerhouse

ByteDance's Seedream 5.0 Lite: Your New AI-Powered Visual Thinking Partner

Alibaba's Qwen-Image-2.0 Merges Creation and Editing in Stunning 2K Detail

AI Luminary Peng Tianyu Takes Helm at Tencent Hunyuan's Multimodal Research

Hikvision's AI Inspector Tackles Factory Packaging Errors

Small AI Model Packs Big Punch: Step3-VL-10B Challenges Giants

Popular Articles

TSMC Reports Record Revenue, AI Growth Fuels Optimism for 2025

Anthropic Enhances Claude AI for Financial Analysts

Breakthrough in Robot Vision: AI Now Understands 3D Space Better

South Korea's Zeta AI Chat Outpaces ChatGPT in User Engagement

Demand for Human Customer Service Grows Amid AI Limitations

Main Pages

Content

Others