@KaichaoYou: a great ocr model from baidu! you would be surprisied if you see how popular ocr models are, they are sometimes even mo…
Summary
Baidu's Unlimited-OCR model, using Reference Sliding Window Attention, is now supported in vLLM, enabling efficient one-shot parsing of entire books with constant memory usage.
View Cached Full Text
Cached at: 06/29/26, 02:22 AM
a great ocr model from baidu! you would be surprisied if you see how popular ocr models are, they are sometimes even more popular than top-tier LLM!😜
vLLM (@vllm_project): 🎉 Unlimited-OCR from @Baidu_Inc now runs in vLLM. One-shot parsing of entire books with constant KV cache, powered by Reference Sliding Window Attention (R-SWA).
🧠 R-SWA keeps KV cache fixed throughout decoding — no memory blowup, no slowdown, no matter how long the output
Similar Articles
@BaiduAI_News: We’re open-sourcing Unlimited OCR — built to read long documents in one pass. With 3B total parameters and only 500M ac…
Baidu open-sources Unlimited OCR, a 3B parameter model (500M activated) that reads long documents in a single pass using Reference Sliding Window Attention (R-SWA), achieving state-of-the-art results on OmniDocBench.
baidu/Unlimited-OCR
Baidu releases Unlimited-OCR, a new model for one-shot long-horizon document parsing, building on Deepseek-OCR. It supports single image and multi-page/PDF parsing via Hugging Face Transformers and SGLang.
Unlimited OCR: One-Shot Long-Horizon Parsing
Baidu releases Unlimited-OCR, an open-source model for one-shot long-horizon document parsing, building upon Deepseek-OCR with support for single images, multi-page documents, and PDFs.
@thesupermanmx: China just open-sourced an OCR model that transcribes an entire book in a single pass. It's called Unlimited OCR. Built…
China open-sourced Unlimited OCR, an OCR model based on DeepSeek OCR that can transcribe entire books in a single pass with constant memory usage, achieving 93% on standard parsing benchmarks.
@vanstriendaniel: It's raining OCR models again! @Baidu_Inc's Unlimited-OCR is one of the more interesting. You can try it without much e…
This post shows how to serve Baidu's Unlimited-OCR model as a temporary, OpenAI-compatible endpoint on Hugging Face Jobs, enabling multi-page document parsing with features like table-to-HTML and equation-to-LaTeX extraction.