alfworld

Tag

Cards List
#alfworld

SkillDAG: Self-Evolving Typed Skill Graphs for LLM Skill Selection at Scale

arXiv cs.AI · yesterday Cached

Introduces SkillDAG, a self-evolving typed directed graph for LLM skill selection at scale that models inter-skill relationships and allows agents to query and evolve the graph during execution, outperforming baselines on ALFWorld and SkillsBench.

0 favorites 0 likes
#alfworld

What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents

arXiv cs.AI · 2026-05-20 Cached

This paper presents the first systematic study of credit assignment in multi-turn LLM agents, introducing SERL, a selective environment-reweighted learning framework. SERL uses environment feedback to sharpen the RL objective on causally relevant actions, achieving 90.0% and 80.1% success rates on ALFWorld and WebShop respectively.

0 favorites 0 likes
← Back to home

Submit Feedback