@AdinaYakup: Unlimited-OCR New OCR from @PaddlePaddle It can parse hundreds of pages in a single pass while maintaining stable speed…
Summary
PaddlePaddle releases Unlimited-OCR, a new OCR model using Reference Sliding Window Attention (R-SWA) to maintain constant KV cache during decoding, achieving 93% on OmniDocBench and a 6% improvement over previous methods.
View Cached Full Text
Cached at: 06/22/26, 05:38 PM
Unlimited-OCR 🔥New OCR from @PaddlePaddle
It can parse hundreds of pages in a single pass while maintaining stable speed.
The key idea is R-SWA (Reference Sliding Window Attention), which keeps KV cache constant during decoding.
🏆 93% on OmniDocBench 📈 +6% over https://t.co/uuXPUhL22L
Similar Articles
Unlimited OCR Works
Unlimited OCR introduces Reference Sliding Window Attention to eliminate growing memory consumption in long-sequence OCR tasks, enabling efficient transcription of multiple pages in a single forward pass.
PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training
PaddleOCR-VL-1.6 improves document parsing by identifying and refining under-optimized regions via targeted data optimization and progressive post-training, achieving state-of-the-art 96.33% on OmniDocBench v1.6.
PaddlePaddle/PaddleOCR
PaddleOCR is a powerful, lightweight OCR toolkit that converts PDFs and images into structured data for AI applications, supporting 100+ languages and designed to bridge documents with LLMs.
@AdinaYakup: PP-OCRv6 just released by Baidu @PaddlePaddle tiny 1.5M / small 7.7M / medium 34.5M 48+ languages Supports handwritten/…
Baidu's PaddlePaddle released PP-OCRv6, an OCR model supporting 48+ languages with tiny (1.5M), small (7.7M), and medium (34.5M) sizes, optimized for edge deployment and handwritten/printed/industrial/screen/card text.
@GoSailGlobal: Current OCR processes multi-page documents page by page. Every time you turn a page, memory is reset. Today, Baidu quietly open-sourced a model on GitHub and HuggingFace called Unlimited OCR, inspired by how humans copy books: - When copying a book, you don't reread hundreds of pages every time you write a word...
Baidu has open-sourced the Unlimited OCR model, which uses a Reference Sliding Window Attention (R-SWA) mechanism to parse documents up to 32K context in a single pass, eliminating the need for page-by-page inference.