active-interaction

Tag

Cards List
#active-interaction

AIPO: : Learning to Reason from Active Interaction

arXiv cs.CL · 2d ago Cached

This paper introduces AIPO, a reinforcement learning framework that enhances LLM reasoning by allowing the model to actively consult collaborative agents during exploration to overcome capability boundaries.

0 favorites 0 likes
← Back to home

Submit Feedback