Baidu's PaddleOCR-VL Leads Global OCR Rankings
Baidu's PaddleOCR-VL Dominates Global OCR Rankings
On October 16, Baidu's PaddlePaddle team unveiled PaddleOCR-VL, its latest vision-language model, which quickly became a sensation in the optical character recognition (OCR) field. The model scored 92.56 in the authoritative OmniDocBench V1.5 benchmark, outperforming rivals like DeepSeek-OCR and securing the top spot globally.

Huggingface Trending List Dominance
As of October 21, Huggingface's Trending Models list was dominated by OCR models:
- 🥇 PaddleOCR-VL (PaddlePaddle)
- 🥈 DeepSeek-OCR
- 🥉 NanonetOCR
PaddleOCR-VL maintained its lead for five consecutive days, cementing its status as the most talked-about open-source OCR model.
Advanced Capabilities
The model supports 109 languages and excels at parsing complex documents, including:
- Text
- Tables
- Formulas
- Charts
It also features document semantic structure reconstruction, enabling it not only to recognize characters but also to understand document context. This makes it invaluable for applications like research papers, invoice processing, and knowledge extraction.
Industry Collaboration
The DeepSeek team acknowledged PaddleOCR in their research paper, revealing they used its annotations for training data. This highlights a broader trend: leading AI institutions—including Baidu, DeepSeek, and Shanghai AI Lab—are open-sourcing OCR models to advance foundational capabilities for large-scale AI training.
The current "OCR arms race" isn’t just about accuracy; it’s about accelerating AI’s ability to interpret text and images worldwide.
Key Points:
- PaddleOCR-VL scored 92.56 in OmniDocBench V1.5.
- Led Huggingface’s trending list for 5 days.
- Supports 109 languages and complex document parsing.
- Industry collaboration underscores its role in AI data annotation.


