Tag
Jerry Liu discusses challenges with using Vision Language Models for PDF parsing, particularly around ensuring text correctness and maintaining proper reading order while avoiding hallucinations.
abiruyt/text-extract-ocr is an open-source OCR model available on Replicate, running on CPU with low cost and fast inference.