🚀PP-OCRv6 is officially released !
Summary
PaddleOCR releases PP-OCRv6, a new OCR model series with sizes from 1.5M to 34.5M parameters, offering improved accuracy and faster inference, supporting 50 languages and new scenarios like PCB and CAD drawings, under Apache 2.0 open source license.
Similar Articles
@AdinaYakup: PP-OCRv6 just released by Baidu @PaddlePaddle tiny 1.5M / small 7.7M / medium 34.5M 48+ languages Supports handwritten/…
Baidu's PaddlePaddle released PP-OCRv6, an OCR model supporting 48+ languages with tiny (1.5M), small (7.7M), and medium (34.5M) sizes, optimized for edge deployment and handwritten/printed/industrial/screen/card text.
@TeksEdge: Need to OCR documents? PP-OCRv6 dropped — currently the best open-source OCR models you can download ◆︎ Fully Open Sour…
PP-OCRv6 is a new open-source OCR model series from Baidu's PaddleOCR, available in Tiny/Small/Medium sizes with excellent accuracy and speed, beating several commercial models.
PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend
PaddleOCR 3.5 adds a Transformers inference backend, enabling OCR and document parsing models like PP-OCRv5 and PaddleOCR-VL 1.5 to run seamlessly within the Hugging Face ecosystem.
PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training
PaddleOCR-VL-1.6 improves document parsing by identifying and refining under-optimized regions via targeted data optimization and progressive post-training, achieving state-of-the-art 96.33% on OmniDocBench v1.6.
PaddlePaddle/PaddleOCR
PaddleOCR is a powerful, lightweight OCR toolkit that converts PDFs and images into structured data for AI applications, supporting 100+ languages and designed to bridge documents with LLMs.