@thesupermanmx: China just open-sourced an OCR model that transcribes an entire book in a single pass. It's called Unlimited OCR. Built…

X AI KOLs Timeline Models

Summary

China open-sourced Unlimited OCR, an OCR model based on DeepSeek OCR that can transcribe entire books in a single pass with constant memory usage, achieving 93% on standard parsing benchmarks.

China just open-sourced an OCR model that transcribes an entire book in a single pass. It's called Unlimited OCR. Built on DeepSeek OCR with one key fix to attention, so memory stays flat no matter how long the document. → 93% on standard parsing benchmark (+6 over baseline) → <0.11 error rate at 40+ pages → Dozens of pages per pass → Steady speed where the old one slows 35% 100% Open Source.
Original Article
View Cached Full Text

Cached at: 06/28/26, 08:04 AM

China just open-sourced an OCR model that transcribes an entire book in a single pass.

It’s called Unlimited OCR. Built on DeepSeek OCR with one key fix to attention, so memory stays flat no matter how long the document.

→ 93% on standard parsing benchmark (+6 over baseline) → <0.11 error rate at 40+ pages → Dozens of pages per pass → Steady speed where the old one slows 35%

100% Open Source.

Similar Articles

@GoSailGlobal: Current OCR processes multi-page documents page by page. Every time you turn a page, memory is reset. Today, Baidu quietly open-sourced a model on GitHub and HuggingFace called Unlimited OCR, inspired by how humans copy books: - When copying a book, you don't reread hundreds of pages every time you write a word...

X AI KOLs Timeline

Baidu has open-sourced the Unlimited OCR model, which uses a Reference Sliding Window Attention (R-SWA) mechanism to parse documents up to 32K context in a single pass, eliminating the need for page-by-page inference.

Unlimited OCR: One-Shot Long-Horizon Parsing

Hacker News Top

Baidu releases Unlimited-OCR, an open-source model for one-shot long-horizon document parsing, building upon Deepseek-OCR with support for single images, multi-page documents, and PDFs.

baidu/Unlimited-OCR

Hugging Face Models Trending

Baidu releases Unlimited-OCR, a new model for one-shot long-horizon document parsing, building on Deepseek-OCR. It supports single image and multi-page/PDF parsing via Hugging Face Transformers and SGLang.