magistrate-tasks

标签

Cards List
#magistrate-tasks

Magis-Bench: Evaluating LLMs on Magistrate-Level Legal Tasks

arXiv cs.CL · 3天前 缓存

This article introduces Magis-Bench, a benchmark for evaluating large language models on magistrate-level legal tasks such as judicial reasoning and sentence drafting, using data from Brazilian judicial exams.

0 人收藏 0 人点赞
← 返回首页

提交意见反馈