🚀PP-OCRv6 is officially released !

Reddit r/LocalLLaMA 06/12/26, 08:06 AM Models

ocr paddleocr open-source edge-ai multi-language model-release

Summary

PaddleOCR releases PP-OCRv6, a new OCR model series with sizes from 1.5M to 34.5M parameters, offering improved accuracy and faster inference, supporting 50 languages and new scenarios like PCB and CAD drawings, under Apache 2.0 open source license.

🔥PaddleOCR’s new OCR model series scales from 1.5M to 34.5M parameters, bringing stronger accuracy, faster inference, and broader deployment options — from browsers and edge devices to servers. 📊What’s new: 🔸Tiny / Small / Medium models: 1.5M, 7.7M, 34.5M params 🔸+4.9% detection accuracy and +5.1% recognition accuracy over PP-OCRv5 🔸Up to 5.2× faster CPU inference with OpenVINO 🔸50 languages in one unified model 🔸New scenarios: PCB, CAD drawings, digital tubes, dot-matrix text 🔸Apache 2.0 open source ✨Lightweight OCR, built for the AI data era. 🔗Try it: 🌐 https://paddleocr.com 💻 https://github.com/PaddlePaddle/P addleOCR 🤗https://huggingface.co/collections/Pa ddlePaddle/pp-ocrv6

Original Article

Similar Articles

@AdinaYakup: PP-OCRv6 just released by Baidu @PaddlePaddle tiny 1.5M / small 7.7M / medium 34.5M 48+ languages Supports handwritten/…

X AI KOLs Following

Baidu's PaddlePaddle released PP-OCRv6, an OCR model supporting 48+ languages with tiny (1.5M), small (7.7M), and medium (34.5M) sizes, optimized for edge deployment and handwritten/printed/industrial/screen/card text.

@TeksEdge: Need to OCR documents? PP-OCRv6 dropped — currently the best open-source OCR models you can download ◆︎ Fully Open Sour…

X AI KOLs Timeline

PP-OCRv6 is a new open-source OCR model series from Baidu's PaddleOCR, available in Tiny/Small/Medium sizes with excellent accuracy and speed, beating several commercial models.

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

Hugging Face Blog

PaddleOCR 3.5 adds a Transformers inference backend, enabling OCR and document parsing models like PP-OCRv5 and PaddleOCR-VL 1.5 to run seamlessly within the Hugging Face ecosystem.

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

Hugging Face Daily Papers

PaddleOCR-VL-1.6 improves document parsing by identifying and refining under-optimized regions via targeted data optimization and progressive post-training, achieving state-of-the-art 96.33% on OmniDocBench v1.6.

PaddlePaddle/PaddleOCR

GitHub Trending (daily)

PaddleOCR is a powerful, lightweight OCR toolkit that converts PDFs and images into structured data for AI applications, supporting 100+ languages and designed to bridge documents with LLMs.