Tag
This paper introduces EURO-5K, a sentence-level dataset for extracting reporting obligations from EU legislation, and benchmarks discriminative and generative transformer models under full fine-tuning and parameter-efficient QLoRA. Results show that legal pretraining primarily benefits models with limited adaptation capacity, and all approaches converge around 3K samples.