@thesupermanmx: China just open-sourced an OCR model that transcribes an entire book in a single pass. It's called Unlimited OCR. Built…

X AI KOLs Timeline 06/27/26, 12:11 PM Models

ocr open-source deepseek-ocr document-parsing memory-efficient ai-model

Summary

China open-sourced Unlimited OCR, an OCR model based on DeepSeek OCR that can transcribe entire books in a single pass with constant memory usage, achieving 93% on standard parsing benchmarks.

China just open-sourced an OCR model that transcribes an entire book in a single pass. It's called Unlimited OCR. Built on DeepSeek OCR with one key fix to attention, so memory stays flat no matter how long the document. → 93% on standard parsing benchmark (+6 over baseline) → <0.11 error rate at 40+ pages → Dozens of pages per pass → Steady speed where the old one slows 35% 100% Open Source.

Original Article

View Cached Full Text

Cached at: 06/28/26, 08:04 AM

China just open-sourced an OCR model that transcribes an entire book in a single pass.

It’s called Unlimited OCR. Built on DeepSeek OCR with one key fix to attention, so memory stays flat no matter how long the document.

→ 93% on standard parsing benchmark (+6 over baseline) → <0.11 error rate at 40+ pages → Dozens of pages per pass → Steady speed where the old one slows 35%

100% Open Source.

Similar Articles

@GoSailGlobal: Current OCR processes multi-page documents page by page. Every time you turn a page, memory is reset. Today, Baidu quietly open-sourced a model on GitHub and HuggingFace called Unlimited OCR, inspired by how humans copy books: - When copying a book, you don't reread hundreds of pages every time you write a word...

X AI KOLs Timeline

Baidu has open-sourced the Unlimited OCR model, which uses a Reference Sliding Window Attention (R-SWA) mechanism to parse documents up to 32K context in a single pass, eliminating the need for page-by-page inference.

@BaiduAI_News: We’re open-sourcing Unlimited OCR — built to read long documents in one pass. With 3B total parameters and only 500M ac…

X AI KOLs Timeline

Baidu open-sources Unlimited OCR, a 3B parameter model (500M activated) that reads long documents in a single pass using Reference Sliding Window Attention (R-SWA), achieving state-of-the-art results on OmniDocBench.

@thesupermanmx: China just open-sourced an OCR model that transcribes an entire book in a single pass. It's called Unlimited OCR. Built…

Similar Articles

@BaiduAI_News: We’re open-sourcing Unlimited OCR — built to read long documents in one pass. With 3B total parameters and only 500M ac…

Unlimited OCR: One-Shot Long-Horizon Parsing

@_akhaliq: Baidu just released Unlimited-OCR

baidu/Unlimited-OCR

Submit Feedback