@oliviscusAI: You can now parse any document with one 1.7B parameter model It’s called dots-ocr. One system that handles text, tables…

X AI KOLs Timeline Models

Summary

The article introduces dots-ocr, a 1.7B parameter model capable of parsing text, tables, formulas, and images from documents in over 100 languages without needing separate OCR pipelines.

You can now parse any document with one 1.7B parameter model 🤯 It’s called dots-ocr. One system that handles text, tables, formulas, images, and PDFs across 100+ languages. No separate OCR pipeline. No task-specific models. https://t.co/KTK8GrZ9hf
Original Article
View Cached Full Text

Cached at: 05/13/26, 10:18 AM

You can now parse any document with one 1.7B parameter model 🤯

It’s called dots-ocr. One system that handles text, tables, formulas, images, and PDFs across 100+ languages.

No separate OCR pipeline. No task-specific models. https://t.co/KTK8GrZ9hf

Similar Articles

Unlimited OCR: One-Shot Long-Horizon Parsing

Hacker News Top

Baidu releases Unlimited-OCR, an open-source model for one-shot long-horizon document parsing, building upon Deepseek-OCR with support for single images, multi-page documents, and PDFs.

dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model

Papers with Code Trending

This paper presents dots.ocr, a unified Vision-Language Model that jointly learns layout detection, text recognition, and relational understanding for multilingual document layout parsing. It achieves state-of-the-art results on OmniDocBench and introduces the XDocParse benchmark spanning 126 languages.

baidu/Unlimited-OCR

Hugging Face Models Trending

Baidu releases Unlimited-OCR, a new model for one-shot long-horizon document parsing, building on Deepseek-OCR. It supports single image and multi-page/PDF parsing via Hugging Face Transformers and SGLang.