hf-paper

Tag

Cards List
#hf-paper

APPO: Agentic Procedural Policy Optimization

Hugging Face Daily Papers · 5d ago Cached

APPO improves multi-turn tool-use in LLM agents by refining branching decisions and credit assignment using fine-grained decision points and procedure-level advantage scaling, outperforming baselines by 4 points on 13 benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback