Skip to main content

Baidu Unveils PaddleOCR-VL, Setting New OCR Benchmark

Baidu's PaddleOCR-VL Redefines Document Processing Standards

Baidu has officially released its PaddleOCR-VL, a state-of-the-art multimodal document parsing model that has set new performance benchmarks in optical character recognition (OCR) technology. The open-source model achieved a world-leading 92.6 score on the authoritative OmniBenchDoc V1.5 evaluation, demonstrating exceptional capabilities across four key areas: text recognition, table extraction, formula interpretation, and reading order prediction.

Technical Breakthroughs

The 0.9B parameter model combines efficiency with high performance through its innovative architecture:

  • Integrates NaViT dynamic resolution visual encoder with ERNIE-4.5-0.3B language model
  • Processes 1881 Tokens/second on single A100 GPU (253% faster than dots.ocr)
  • Supports 109 languages, including complex scripts like Arabic and Chinese

Image

Performance Metrics

PaddleOCR-VL delivers unprecedented accuracy:

  • Text edit distance: 0.035
  • Formula recognition (CDM): 91.43
  • Table extraction (TEDS): 93.52
  • Reading order error: 0.043

These metrics prove its reliability for challenging applications like historical archive digitization and handwritten manuscript processing.

Image

Innovative Architecture

The model's two-stage approach revolutionizes document understanding:

  1. Layout detection and reading order prediction
  2. Structured output of text, tables, and formulas

This methodology enables human-like comprehension of complex documents including financial reports and academic papers while maintaining logical flow.

Image

Practical Applications

The technology addresses critical needs across sectors:

  • Government document management systems
  • Enterprise knowledge retrieval platforms
  • Academic research information extraction
  • Historical archive preservation projects

The lightweight design makes it particularly suitable for deployment in resource-constrained environments.

Key Points:

  • 🏆 World-leading performance on OmniBenchDoc V1.5 (92.6 score)
  • ⚡ Ultra-efficient processing at 1881 Tokens/second
  • 🌍 Supports 109 languages including complex scripts
  • 🧠 Human-like understanding of document layouts
  • 🔓 Open-source availability promotes widespread adoption

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

Musk Takes Aim at OpenAI in Court: Claims ChatGPT Risks Outweigh Benefits

Elon Musk made explosive claims in court this week, alleging OpenAI's ChatGPT has driven users to suicide while touting his xAI's safety record. The Tesla CEO testified in a lawsuit stemming from his signature on a 2023 open letter calling for AI development pauses. While criticizing OpenAI's profit motives, Musk faces scrutiny himself as regulators investigate explicit content generated by his Grok AI.

February 28, 2026
ArtificialIntelligenceTechRegulationElonMusk
News

Baidu's AI Pivot Pays Off: Nearly Half Its Revenue Now Comes from Artificial Intelligence

Baidu's latest financial report reveals a striking transformation - artificial intelligence now drives 43% of the company's revenue. The Chinese tech giant's end-to-end AI strategy, spanning chips to cloud services, has lowered adoption barriers for businesses while consumer-facing AI tools saw explosive 301% growth. With its Robotaxi expansion and $5 billion buyback plan, Baidu is betting big on AI as its future growth engine.

February 28, 2026
BaiduArtificial IntelligenceTech Transformation
xAI's Founding Team Shrinks as Another Co-Founder Steps Down
News

xAI's Founding Team Shrinks as Another Co-Founder Steps Down

Elon Musk's AI venture xAI faces another high-profile departure as co-founder Toby Pohlen announces his exit. With Pohlen's resignation, only five of the original twelve founding members remain at the company. The former digital agent project lead shared heartfelt reflections on social media about his intense three-year journey, joking about finally getting proper sleep. This marks the seventh founding member to leave since xAI's inception less than three years ago.

February 27, 2026
xAIElonMuskArtificialIntelligence
News

China Takes Lead in AI Adoption as Domestic Models Gain Global Traction

In a significant shift for the tech landscape, Chinese AI models have surpassed their US counterparts in global usage. Recent data reveals explosive growth for China's AI offerings, with domestic models now dominating four of the top five spots worldwide. Meanwhile, diplomatic exchanges and currency movements suggest broader economic implications behind this technological transition.

February 27, 2026
ArtificialIntelligenceChinaTechGlobalEconomy
NVIDIA Hits $216B Revenue as AI Takes Autonomous Leap
News

NVIDIA Hits $216B Revenue as AI Takes Autonomous Leap

NVIDIA's financial success reaches new heights with $216 billion in annual revenue, driven by what CEO Jensen Huang calls AI's 'inflection point' toward autonomous action. The tech giant isn't alone - OpenAI's new partnership and Samsung's latest devices signal an industry-wide shift from chatbots to AI agents that plan vacations and fold laundry. Huang predicts physical AI in robotics will be the next frontier.

February 27, 2026
ArtificialIntelligenceNVIDIATechInnovation
Baidu's AI Business Hits 40 Billion Yuan Milestone in 2025
News

Baidu's AI Business Hits 40 Billion Yuan Milestone in 2025

Baidu's latest financial report reveals impressive growth across its AI divisions, with total AI-related revenue reaching 40 billion yuan in 2025. The tech giant saw particularly strong performance in cloud computing (up 34%), autonomous driving (200% ride-hailing growth), and AI-native marketing services (301% revenue increase). With its Wenxin large model and expanding global footprint, Baidu continues to solidify its position as China's AI leader.

February 26, 2026
ArtificialIntelligenceTechEarningsChineseTech