document-classification

#document-classification

Revising RVL-CDIP: Quantifying Errors and Test-Train Overlap

arXiv cs.CL ↗ · yesterday Cached

This paper identifies and corrects label errors and test-train overlap in the RVL-CDIP document classification dataset, finding 12% label errors and 35% duplication. Correction improves classification accuracy and out-of-distribution generalization.

0 favorites 0 likes

#document-classification

Enhancing BiGRU with a KAN Block for Legal Document Classification and Summarization

arXiv cs.CL ↗ · 2026-06-02 Cached

This paper introduces a KAN-enhanced BiGRU architecture for classifying and summarizing multilingual legal documents from Bangladesh, achieving modest accuracy and ROUGE scores and demonstrating that the KAN block improves classification accuracy over the baseline BiGRU.

0 favorites 0 likes

#document-classification