automated-curation

#automated-curation

Mask-Proof: An LLM-based Automated Data Curation Pipeline on Mathematical Proofs

arXiv cs.AI ↗ · yesterday Cached

Introduces Mask-Proof, an LLM-based pipeline that converts mathematical proofs into masked-step tasks for automated evaluation, and presents MaskProofBench, a benchmark of 292 curated problems achieving 96.8% agreement with expert annotators.

0 favorites 0 likes

#automated-curation

MIND-Skill: Quality-Guaranteed Skill Generation via Multi-Agent Induction and Deduction

arXiv cs.AI ↗ · 2026-05-12 Cached

MIND-Skill is a new framework introduced in this research paper that automates the generation of high-quality, reusable agent skills using multi-agent induction and deduction with quality guarantees via TextGrad optimization.

0 favorites 0 likes

automated-curation

Mask-Proof: An LLM-based Automated Data Curation Pipeline on Mathematical Proofs

MIND-Skill: Quality-Guaranteed Skill Generation via Multi-Agent Induction and Deduction

Submit Feedback