Ant Group Unveils Multilingual AI Framework for Document Security

Ant Group's Breakthrough in Multilingual AI Security

At the recent Hong Kong FinTech Festival, Ant Financial Technology unveiled its revolutionary Multilingual Multimodal Large Model Training Framework, designed to overcome language barriers in AI applications. This innovation addresses critical challenges in global document verification and fraud detection.

Solving the Language Bottleneck

Traditional AI models primarily trained on English data often struggle with:

  • Language confusion in minority languages
  • Inconsistent reasoning across multilingual contexts
  • Poor performance in resource-scarce linguistic environments

The new framework achieved top rankings in the Multicultural Multilingual Visual Question Answering (CVQA) benchmark, particularly excelling in:

  • Egyptian Arabic
  • Javanese
  • Bahasa Indonesia
  • Sundanese

Image

Technical Innovations

The system's breakthrough comes from three core components:

  1. Target-language thinking mechanism: Processes information natively in each language
  2. Multi-dimensional reward strategies: Fine-tunes model performance across linguistic dimensions
  3. Automated data solutions: Compensates for scarce training data in minority languages

Comparative tests show the framework:

  • Improves accuracy by 9.5% over similar open-source models
  • Outperforms GPT-4o and Gemini-2.5-flash in specific tasks
  • Achieves highest overall score in multilingual VQA benchmarks

Enhanced Security Capabilities

The integrated security framework combines:

  • Visual analysis for detecting image tampering
  • Common sense reasoning to identify logical inconsistencies
  • Explainable AI that pinpoints manipulated areas with reasoning

These features significantly boost risk management for:

  • Insurance claims processing
  • Credit application reviews
  • Cross-border trade documentation

Global Implementation

The technology currently powers Ant's ZOLOZ RealDoc platform, supporting:

  • 119 languages for document authentication
  • Processing of complex business contracts and trade documents
  • Compliance with international financial regulations

The system has demonstrated particular effectiveness in Southeast Asian markets where multilingual documentation is common.

Key Points:

  • First multilingual framework to outperform major closed-source models
  • 9.5% accuracy improvement over comparable open-source alternatives
  • Supports 119 languages through innovative training architecture
  • Combines visual forgery detection with logical consistency checks
  • Currently deployed in Ant Group's global financial services

Related Articles