self-supervision

#self-supervision

The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models

arXiv cs.CL ↗ · 2026-05-26 Cached

This paper introduces TraceLock, a lightweight plug-in controller that learns a token-commitment policy for frozen diffusion language models, improving the quality-step tradeoff across various tasks without retraining.

0 favorites 0 likes

#self-supervision

EVE-Agent: Evidence-Verifiable Self-Evolving Agents

arXiv cs.AI ↗ · 2026-05-25 Cached

EVE-Agent introduces a framework for self-evolving search agents that ensure evidence verifiability by generating questions, answers, and evidence spans, and training on marginal accuracy gain of evidence. This improves grounded correctness without human annotations.

0 favorites 0 likes

self-supervision

The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models

EVE-Agent: Evidence-Verifiable Self-Evolving Agents

Submit Feedback