alfworld

#alfworld

SkillDAG: Self-Evolving Typed Skill Graphs for LLM Skill Selection at Scale

arXiv cs.AI ↗ · yesterday Cached

Introduces SkillDAG, a self-evolving typed directed graph for LLM skill selection at scale that models inter-skill relationships and allows agents to query and evolve the graph during execution, outperforming baselines on ALFWorld and SkillsBench.

0 favorites 0 likes

#alfworld

What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents

arXiv cs.AI ↗ · 2026-05-20 Cached

This paper presents the first systematic study of credit assignment in multi-turn LLM agents, introducing SERL, a selective environment-reweighted learning framework. SERL uses environment feedback to sharpen the RL objective on causally relevant actions, achieving 90.0% and 80.1% success rates on ALFWorld and WebShop respectively.

0 favorites 0 likes

alfworld

SkillDAG: Self-Evolving Typed Skill Graphs for LLM Skill Selection at Scale

What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents

Submit Feedback