PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters
Summary
PP-OCRv6 is the latest generation of PaddleOCR's universal OCR model family, offering three tiers from 1.5M to 34.5M parameters, supporting 50 languages, and achieving significant accuracy improvements over previous versions.
View Cached Full Text
Cached at: 06/22/26, 01:31 PM
PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters
Source: https://huggingface.co/blog/PaddlePaddle/pp-ocrv6 Back to Articles
- What’s new in PP-OCRv6- Three model tiers - PPLCNetV4 backbone - RepLKFPN for text detection - EncoderWithLightSVTR for recognition - Unified multilingual OCR - Quick start with PaddleOCR - Available inference backends - Conclusion Evaluate PP-OCRv6 online, then integrate lightweight, production-ready OCR with PaddlePaddle, Transformers, or ONNX Runtime backend.
PP-OCRv6 is the latest generation of PaddleOCR’s universal OCR model family. It is designed for real-world text detection and recognition across documents, screenshots, multilingual images, digital displays, industrial labels, and scene text.
The model family scales from1.5M to 34.5M parameters, with three tiers:tiny,small, andmedium. The medium and small tiers support50 languages, including Simplified Chinese, Traditional Chinese, English, Japanese, and 46 Latin-script languages. Try PP-OCRv6 online quickly:PP-OCRv6 Online Demo.
On PaddleOCR’s official in-house multi-scenario OCR benchmarks,PP-OCRv6_mediumreaches86.2% detection Hmeanand83.2% recognition accuracy. Compared with PP-OCRv5_server, it improves text detection by**+4.6 percentage pointsand text recognition by+5.1 percentage points**.
PP-OCRv6 focuses on a practical OCR need: producing accurate, structured text outputs with small models and flexible deployment options. For a deeper discussion of why specialized OCR models remain useful in the VLM era, see our previous blog:PP-OCRv5 on Hugging Face: A Specialized Approach to OCR.
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#whats-new-in-pp-ocrv6What’s new in PP-OCRv6
PP-OCRv6 introduces architecture, training, and data improvements across detection and recognition. The main design goal is to improve OCR accuracy while keeping model sizes suitable for different deployment settings.
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#three-model-tiersThree model tiers
PP-OCRv6 provides three model tiers, covering different model sizes and OCR accuracy levels.
ModelModel sizeDetection HmeanRecognition accuracyTypical application scenariosPP-OCRv6_tiny****1.5M params80.6%73.5%Edge devices, lightweight local OCR, latency-sensitive demos, constrained environmentsPP-OCRv6_small****7.7M params84.1%81.3%Mobile, desktop, balanced OCR services, multilingual OCR with lower compute cost**PP-OCRv6_medium34.5M params86.2%****83.2%**Accuracy-oriented OCR, server-side pipelines, industrial OCR, document ingestion, multilingual OCR
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#pplcnetv4-backbonePPLCNetV4 backbone
PP-OCRv6 usesPPLCNetV4as a unified backbone for text detection and text recognition.
For developers, the main benefit is consistency across the model family. The tiny, small, and medium tiers are not unrelated models; they are part of the same OCR family and share a common architectural direction.
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#replkfpn-for-text-detectionRepLKFPN for text detection
Text detection is the first stage of the OCR pipeline. Detection quality affects the crops sent to the recognizer, and poor crops often lead to poorer recognition.
PP-OCRv6 upgrades the detection module withRepLKFPN, a lightweight large-kernel feature pyramid network designed for multi-scale text detection while keeping inference efficient.
This is relevant for real-world OCR inputs, where text may be small, dense, rotated, low-resolution, or embedded in complex backgrounds.
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#encoderwithlightsvtr-for-recognitionEncoderWithLightSVTR for recognition
For text recognition, PP-OCRv6 usesEncoderWithLightSVTR. It combines local context modeling with global attention to improve recognition quality on challenging text crops.
The recognition improvements are especially relevant for multilingual text, screen text, industrial characters, special symbols, dense text, and noisy image regions.
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#unified-multilingual-ocrUnified multilingual OCR
The medium and small tiers support50 languagesin one model family, covering Simplified Chinese, Traditional Chinese, English, Japanese, and 46 Latin-script languages.
This helps reduce the need for separate OCR models across common multilingual OCR scenarios.
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#quick-start-with-paddleocrQuick start with PaddleOCR
Install PaddleOCR:
pip install paddleocr
Run OCR with Paddle Infernece(Default backend):
from paddleocr import PaddleOCR
# Model: PP-OCRv6_medium(Default)
# Backend: Paddle Inference(Default)
ocr = PaddleOCR(
use_doc_orientation_classify=False,
use_doc_unwarping=False,
use_textline_orientation=False,
)
result = ocr.predict("https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png")
for res in result:
res.print()
res.save_to_img("output")
res.save_to_json("output")
The OCR result can be saved as visualization images and structured JSON output. The structured output can then be used by downstream systems such as document parsing, search, extraction, RAG, analytics, or agent workflows.
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#available-inference-backendsAvailable inference backends
PP-OCRv6 can be used with multiple inference backends through PaddleOCR.PaddleOCR 3.7provides a unified inference-engine interface, whereengineselects the underlying runtime and related configuration can be passed through the pipeline or module API.
BackendDescriptionTransformersHugging Face / PyTorch-oriented inference path for supported PaddleOCR modelsONNX RuntimePortable inference path for ONNX-based deployment environmentsPaddle InferenceNative Paddle inference format For Hugging Face users, PaddleOCR supports running selected OCR and document parsing models with a Transformers backend. This can be enabled with:
engine="transformers"
For more details on how the Transformers backend works in PaddleOCR, see:
PaddleOCR: Running OCR and Document Parsing Tasks with a Transformers Backend
Run PP-OCRv6 example with Transformer Backend:
from paddleocr import PaddleOCR
# Model: PP-OCRv6_medium(Default)
# Backend: transformers
ocr = PaddleOCR(
use_doc_orientation_classify=False,
use_doc_unwarping=False,
use_textline_orientation=False,
engine="transformers",
)
result = ocr.predict("https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png")
ONNX variants are also available in thePP-OCRv6 Collectionfor environments that use ONNX Runtime throughengine="onnxruntime":
from paddleocr import PaddleOCR
# Model: PP-OCRv6_medium(Default)
# Backend: ONNX Runtime
ocr = PaddleOCR(
use_doc_orientation_classify=False,
use_doc_unwarping=False,
use_textline_orientation=False,
engine="onnxruntime",
)
result = ocr.predict("https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png")
Together, these backend options make PP-OCRv6 available across different runtime environments while keeping the same OCR model family on the Hugging Face Hub.
https://huggingface.co/blog/PaddlePaddle/pp-ocrv6#conclusionConclusion
PP-OCRv6 extends PaddleOCR with a lightweight, multilingual OCR model family for real-world text detection and recognition.
The release includes three model tiers from1.5M to 34.5M parameters, up to50-language OCR support, improved detection and recognition accuracy over PP-OCRv5_server, and multiple model formats on the Hugging Face Hub, includingsafetensors,Paddle inference models, andONNX models.
Together with the hosted Hugging Face Space and the available PaddleOCR inference backends, PP-OCRv6 provides several entry points for evaluation and integration:
- Online Demo:PP-OCRv6 Online Demo
- Model Collection:PP-OCRv6 Collection
- Transformers Backend Blog:PaddleOCR with Transformers Backend
- PaddleOCR Documentation:PP-OCRv6 Documentation
- PaddleOCR:PP-OCRv6 Documentation
- PaddleOCR Official Website:https://www.paddleocr.com
You can evaluate PP-OCRv6 with the online demo, explore the available model assets in the Collection, and use the inference backend that matches your own OCR workflow.
Similar Articles
🚀PP-OCRv6 is officially released !
PaddleOCR releases PP-OCRv6, a new OCR model series with sizes from 1.5M to 34.5M parameters, offering improved accuracy and faster inference, supporting 50 languages and new scenarios like PCB and CAD drawings, under Apache 2.0 open source license.
@AdinaYakup: PP-OCRv6 just released by Baidu @PaddlePaddle tiny 1.5M / small 7.7M / medium 34.5M 48+ languages Supports handwritten/…
Baidu's PaddlePaddle released PP-OCRv6, an OCR model supporting 48+ languages with tiny (1.5M), small (7.7M), and medium (34.5M) sizes, optimized for edge deployment and handwritten/printed/industrial/screen/card text.
@PaddlePaddle: PP-OCRv6 Tech Deep Dive Ep.1: In the Era of Large Models, Why Does Lightweight OCR Still Have Irreplaceable Value? — PP…
PP-OCRv6 is a lightweight OCR model (34.5M parameters) that challenges large VLMs with its MetaFormer architecture, offering efficient text detection and recognition across multiple deployment scenarios.
@TeksEdge: Need to OCR documents? PP-OCRv6 dropped — currently the best open-source OCR models you can download ◆︎ Fully Open Sour…
PP-OCRv6 is a new open-source OCR model series from Baidu's PaddleOCR, available in Tiny/Small/Medium sizes with excellent accuracy and speed, beating several commercial models.
PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend
PaddleOCR 3.5 adds a Transformers inference backend, enabling OCR and document parsing models like PP-OCRv5 and PaddleOCR-VL 1.5 to run seamlessly within the Hugging Face ecosystem.




