reporting-obligation

#reporting-obligation

EURO-5K: When Does Domain Pretraining Matter? Benchmarking Transformers for EU Reporting Obligation Extraction

arXiv cs.CL ↗ · 2026-06-03 Cached

This paper introduces EURO-5K, a sentence-level dataset for extracting reporting obligations from EU legislation, and benchmarks discriminative and generative transformer models under full fine-tuning and parameter-efficient QLoRA. Results show that legal pretraining primarily benefits models with limited adaptation capacity, and all approaches converge around 3K samples.

0 favorites 0 likes

reporting-obligation

EURO-5K: When Does Domain Pretraining Matter? Benchmarking Transformers for EU Reporting Obligation Extraction

Submit Feedback