Tag
Surya OCR is a state-of-the-art open-source OCR model with less than 1B parameters, supporting 91 languages and achieving top benchmark scores under 3B params.
This paper identifies Footprint Bias in document layout analysis robustness evaluation and proposes a structure-aware auditing framework that decouples probe construction and pathway attribution, showing that small structurally targeted probes cause comparable downstream degradation to larger perturbations.