action-guidance

Tag

Cards List
#action-guidance

Learning Agentic Policy from Action Guidance

arXiv cs.CL · 19h ago Cached

The paper proposes ActGuide-RL, a method for training agentic policies in LLMs by using human action data as guidance to overcome exploration barriers in reinforcement learning without extensive supervised fine-tuning.

0 favorites 0 likes
← Back to home

Submit Feedback