@thesupermanmx: China just open-sourced an OCR model that transcribes an entire book in a single pass. It's called Unlimited OCR. Built…
Summary
China open-sourced Unlimited OCR, an OCR model based on DeepSeek OCR that can transcribe entire books in a single pass with constant memory usage, achieving 93% on standard parsing benchmarks.
View Cached Full Text
Cached at: 06/28/26, 08:04 AM
China just open-sourced an OCR model that transcribes an entire book in a single pass.
It’s called Unlimited OCR. Built on DeepSeek OCR with one key fix to attention, so memory stays flat no matter how long the document.
→ 93% on standard parsing benchmark (+6 over baseline) → <0.11 error rate at 40+ pages → Dozens of pages per pass → Steady speed where the old one slows 35%
100% Open Source.
Similar Articles
@GoSailGlobal: Current OCR processes multi-page documents page by page. Every time you turn a page, memory is reset. Today, Baidu quietly open-sourced a model on GitHub and HuggingFace called Unlimited OCR, inspired by how humans copy books: - When copying a book, you don't reread hundreds of pages every time you write a word...
Baidu has open-sourced the Unlimited OCR model, which uses a Reference Sliding Window Attention (R-SWA) mechanism to parse documents up to 32K context in a single pass, eliminating the need for page-by-page inference.
@BaiduAI_News: We’re open-sourcing Unlimited OCR — built to read long documents in one pass. With 3B total parameters and only 500M ac…
Baidu open-sources Unlimited OCR, a 3B parameter model (500M activated) that reads long documents in a single pass using Reference Sliding Window Attention (R-SWA), achieving state-of-the-art results on OmniDocBench.
Unlimited OCR: One-Shot Long-Horizon Parsing
Baidu releases Unlimited-OCR, an open-source model for one-shot long-horizon document parsing, building upon Deepseek-OCR with support for single images, multi-page documents, and PDFs.
@_akhaliq: Baidu just released Unlimited-OCR
Baidu has released Unlimited-OCR, an optical character recognition service with no usage limits.
baidu/Unlimited-OCR
Baidu releases Unlimited-OCR, a new model for one-shot long-horizon document parsing, building on Deepseek-OCR. It supports single image and multi-page/PDF parsing via Hugging Face Transformers and SGLang.