Skip to main content

Grab Develops AI Model for Southeast Asian Languages

Grab Tackles Language Recognition Challenges with Custom AI Model

Singapore-based super app company Grab has developed its own visual language model to address shortcomings in processing Southeast Asian languages, according to a recent technical blog post. The innovation comes as existing commercial solutions struggle with non-Latin scripts common across Grab's eight-country operational footprint.

Image

Image source note: The image is AI-generated

The Compliance Challenge

Grab's platform, which offers ride-hailing, food delivery, and financial services across Singapore, Malaysia, Indonesia and neighboring countries, requires accurate document processing for customer verification. Traditional OCR systems proved inadequate when handling diverse identity documents written in regional scripts.

"We found commercial models made frequent errors with Southeast Asian languages," Grab engineers noted. "Even open-source visual language models lacked sufficient accuracy despite better efficiency."

Building a Specialized Solution

In 2025, Grab began developing its own visual large language model (VLLM) capable of vectorizing images for text extraction. The team selected Alibaba Cloud's Qwen2-VL2B as foundation due to:

  • Moderate model size
  • Native Southeast Asian language support
  • Dynamic handling of varied image resolutions

The company created specialized training data by:

  1. Extracting regional language content from Common Crawl
  2. Building synthetic data pipelines generating text under diverse fonts/backgrounds
  3. Applying low-rank adaptation fine-tuning techniques

The resulting model showed particular success processing Indonesian documents while continuing development for Thai and Vietnamese recognition.

Performance Breakthroughs

The customized solution demonstrates several advantages:

  • Outperforms general OCR tools in accuracy
  • Exceeds commercial LLMs' regional language capabilities
  • Maintains lightweight efficiency through focused training
  • Enables reliable compliance document processing

"Strategic use of high-quality data proves small specialized models can achieve both effectiveness and efficiency," Grab stated.

The company plans further model development to expand its document processing capabilities amid growing operational complexity.

Key Points:

📊 Commercial models underperform on Southeast Asian scripts prompting Grab's custom solution
🔍 Visual LLM breakthrough improves ID/license processing accuracy
🚀 Continued development planned to handle more document types and languages

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Mugen3D Turns Single Photos Into Stunning 3D Worlds
News

Mugen3D Turns Single Photos Into Stunning 3D Worlds

A groundbreaking AI tool called Mugen3D is transforming how we create 3D content. Using advanced 3D Gaussian Splatting technology, it can generate remarkably realistic models from just one image - capturing textures, lighting, and materials with astonishing accuracy. This innovation promises to democratize 3D creation across industries from gaming to e-commerce.

January 12, 2026
AIComputerGraphicsDigitalCreation
News

Qualcomm and Google Join Forces to Revolutionize Car Tech with AI

Qualcomm and Google are teaming up to tackle one of the automotive industry's biggest headaches: fragmented in-car systems. Their new 'Automotive AI Agent' combines Qualcomm's Snapdragon Digital Chassis with Google's Android Automotive OS, promising smoother development and smarter features like facial recognition. The partnership also introduces cloud-based development tools that could cut R&D time significantly. This collaboration marks a major step toward more unified, intelligent vehicle systems.

January 9, 2026
automotive-techAIsmart-cars
News

Bosch Bets Big on AI with €2.5 Billion Push Into Smart Cars

At CES 2026, automotive giant Bosch unveiled plans to invest over €2.5 billion in AI development by 2027, targeting smarter cockpits and safer autonomous driving systems. The German supplier aims to transform from hardware specialist to software leader, projecting its tech division could hit €10 billion in sales by the mid-2030s.

January 7, 2026
BoschAIautonomous vehicles
MiniMax IPO Fever: Hong Kong Investors Flock to China's AI Pioneer
News

MiniMax IPO Fever: Hong Kong Investors Flock to China's AI Pioneer

MiniMax, China's rising star in AI technology, has concluded its Hong Kong IPO with staggering investor enthusiasm. The offering saw subscriptions oversubscribed by 1,209 times, raising over HK$253 billion. Backed by heavyweight investors like Alibaba and Abu Dhabi Investment Authority, MiniMax is set to become one of the fastest-growing AI companies ever to go public when it lists on January 9.

January 6, 2026
AIIPOHongKongMarkets
NVIDIA CEO Hails Open-Source AI Breakthroughs at CES 2026
News

NVIDIA CEO Hails Open-Source AI Breakthroughs at CES 2026

At CES 2026, NVIDIA's Jensen Huang made waves by championing open-source AI development, singling out DeepSeek-R1 as a standout success. The tech leader revealed NVIDIA's plans to open-source training data while showcasing their new Vera Rubin chip. Huang outlined four key areas where AI is transforming industries, predicting these changes will define future technological paradigms.

January 6, 2026
AIOpen SourceNVIDIA
Atlas Robots Take Their First Factory Jobs in Landmark AI Deployment
News

Atlas Robots Take Their First Factory Jobs in Landmark AI Deployment

Boston Dynamics' famous dancing robot has grown up. The fully electric Atlas humanoid is now rolling off production lines, with Hyundai and Google DeepMind getting the first units. These industrial-strength robots can lift 50kg, withstand extreme temperatures, and may soon be assembling your next car. It's a turning point for robotics that once seemed decades away.

January 6, 2026
roboticsAIindustrial automation