Tag
MÖVE is a holistic benchmark for evaluating large language models in the German public sector, covering performance and governance criteria across 39 models using ten German-language datasets.