Tag
Baidu's PaddlePaddle released PP-OCRv6, an OCR model supporting 48+ languages with tiny (1.5M), small (7.7M), and medium (34.5M) sizes, optimized for edge deployment and handwritten/printed/industrial/screen/card text.
The article details Baidu's strategic ownership of four out of five layers in the AI race (chips, cloud, models, apps) showcased at the Baidu Create event, highlighting its resilience against export bans and its ecosystem built over a decade.
PaddleOCR-VL is a compact 0.9B vision-language model that achieves state-of-the-art performance in multilingual document parsing and element recognition by integrating NaViT-style dynamic resolution with the ERNIE language model.