failure-driven-rl

Tag

Cards List
#failure-driven-rl

SENTINEL: Failure-Driven Reinforcement Learning for Training Tool-Using Language Model Agents

arXiv cs.CL · 2026-06-12 Cached

This paper introduces SENTINEL, a failure-driven reinforcement learning framework for training tool-using language model agents. It uses a Controller-Proposer-Solver loop to generate targeted training tasks from failed trajectories, improving performance on benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback