Skip to main content

HKU and Meituan Boost AI Math Skills with CodePlot-CoT

HKU and Meituan Breakthrough: AI Solves Math Problems Through Code Visualization

Large language models have historically struggled with mathematical geometry problems, despite excelling in text-based tasks. A new collaborative study from the University of Hong Kong (HKU) and Meituan presents CodePlot-CoT, an innovative solution that bridges this gap through code-driven visual reasoning.

The Core Challenge

Traditional AI models like GPT-4.1 and Gemini-2.5-Pro falter when faced with problems requiring geometric visualization or function graphing. While proficient in textual reasoning chains, these models lack the precision needed for mathematical graphics where angles, ratios, and positions must adhere to strict geometric constraints.

Image

The CodePlot-CoT Solution

The research team developed a paradigm shift:

  1. Code Generation: Instead of attempting direct image creation, the model writes executable plotting code (e.g., Python's Matplotlib)
  2. Precise Rendering: The code executes in a Python environment to generate accurate diagrams
  3. Integrated Reasoning: The model incorporates these code-generated visuals back into its problem-solving chain

This approach leverages AI's existing programming strengths while avoiding unreliable pixel-level image generation.

Key Technical Components

The project introduced two critical innovations:

  1. Math-VR Dataset: A comprehensive collection of 178,000 bilingual math problems (81% geometry-focused) requiring active drawing alongside reasoning
  2. MatplotCode Converter: A specialized tool converting mathematical figures into precise plotting code, outperforming commercial models in fidelity tests

Image

Performance Breakthroughs

The results demonstrate significant improvements:

  • 21% performance boost on Math-VR benchmark compared to base models
  • Even advanced closed-source models like Gemini-2.5-Pro still fail on one-third of test problems without this approach The findings suggest that scaling model size alone cannot solve visual math reasoning - precise code-driven methods are essential.

Implications for AI Development

The success of CodePlot-CoT suggests:

  • Future multimodal systems should prioritize programmatic precision over human-like visualization
  • Applications extend beyond mathematics to engineering design and scientific computing where accuracy is paramount The team has open-sourced all datasets, code, and pre-trained models to accelerate further research.

Key Points:

  • Traditional AI struggles with geometric precision in math problems
  • CodePlot-CoT replaces unreliable image generation with executable plotting code
  • New Math-VR dataset requires active drawing alongside problem-solving
  • Method delivers 21% performance improvement over conventional approaches
  • Open-source release enables broader adoption across AI community

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

AI Luminary Peng Tianyu Takes Helm at Tencent Hunyuan's Multimodal Research

Peng Tianyu, a rising star in AI research with deep roots at Tsinghua University, has joined Tencent's Hunyuan division as Chief Research Scientist. The machine learning expert will spearhead advancements in multimodal reinforcement learning, blending visual and language AI capabilities. With an impressive track record that includes prestigious awards and publications at top conferences, Peng's move signals Tencent's commitment to pushing boundaries in generative AI technologies.

January 30, 2026
AI ResearchTencent HunyuanMultimodal Learning
News

Google's Gemini 3 Takes AI Reasoning to New Scientific Heights

Google has unveiled Gemini 3 Deep Think, marking a significant leap in AI capabilities beyond everyday conversations. This specialized model tackles complex scientific problems with Olympiad-level reasoning skills, scoring impressively on mathematical and programming challenges. Available now for select researchers and Google AI Ultra subscribers, it promises to transform from benchmark champion to actual lab partner.

February 13, 2026
AI ResearchMachine LearningScientific Computing
News

Apple's Secret Sauce: How a Tuned Open-Source Model Outperformed GPT-5 in UI Design

Apple's research team has achieved a surprising breakthrough in AI-assisted UI development. By collaborating with 21 design experts to provide targeted feedback through sketches and code modifications, they've demonstrated that quality trumps quantity in AI training. Their fine-tuned Qwen3-Coder model, despite its smaller size, now outperforms GPT-5 in generating app interfaces - proving that expert human insight remains invaluable in the age of artificial intelligence.

February 6, 2026
AI ResearchUI DevelopmentMachine Learning
News

Tencent's AI Push Gains Momentum as Top Scientist Tianyu Peng Joins Hunyuan Team

Tencent has made another strategic hire in its AI talent race, bringing on Tianyu Peng as Chief Research Scientist for its Hunyuan multimodal team. The Tsinghua PhD and former Sea AI Lab researcher will focus on advancing reinforcement learning capabilities within Tencent's flagship AI model. This move signals Tencent's continued commitment to competing at the forefront of multimodal AI development.

February 3, 2026
TencentAI ResearchReinforcement Learning
Tsinghua AI Whiz Joins Tencent to Supercharge Multimodal Learning
News

Tsinghua AI Whiz Joins Tencent to Supercharge Multimodal Learning

Tencent's AI ambitions get a major boost as Peng Tianyu, a rising star in machine learning from Tsinghua University, joins their Tongyi team. The 31-year-old prodigy brings expertise in reinforcement learning and multimodal systems, fresh from his stint at Sea AI Lab in Singapore. This marks another strategic hire for Tencent following their recent acquisition of an OpenAI researcher.

January 30, 2026
TencentAI ResearchMachine Learning
News

AI's Scientific Leap: Why 2026 Could Change Research Forever

OpenAI's Kevin Weil predicts AI will revolutionize scientific research by 2026, with GPT-5.2 already scoring higher than human experts in advanced knowledge tests. The focus shifts from AI as all-knowing oracle to humble research partner, helping scientists spot connections they might miss.

January 28, 2026
AI ResearchScientific BreakthroughsGPT-5