skill-learning

Tag

Cards List
#skill-learning

Online Skill Learning for Web Agents via State-Grounded Dynamic Retrieval

arXiv cs.AI · yesterday Cached

This paper proposes SGDR (State-Grounded Dynamic Retrieval), an online skill learning method for web agents that enables stepwise, state-aware skill reuse rather than static task-level retrieval. Experiments on WebArena show SGDR achieves 37.5% success rate with GPT-4.1, a ~10.6% relative gain over strong baselines.

0 favorites 0 likes
#skill-learning

@rohanpaul_ai: AI agents should treat memory as a changing web of useful connections, not static storage. Most agent memory systems re…

X AI KOLs Timeline · 2d ago Cached

该论文提出 FluxMem,一种将智能体记忆视为不断演化的图结构,通过动态修复连接和提炼技能来提升记忆效果的系统。实验显示其在多个任务上优于现有方法,例如在 LoCoMo 上达到 95.06% 准确率,并在 GAIA 上相比 Kimi K2 提升 12.73 分。

0 favorites 0 likes
#skill-learning

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

Hugging Face Daily Papers · 4d ago Cached

MMG2Skill converts web-based procedural guides into executable skills for agents through closed-loop learning, improving performance across GUI control, gameplay, and card play tasks with macro-average gains of +12.8 to +25.3 percentage points.

0 favorites 0 likes
#skill-learning

From History to State: Constant-Context Skill Learning for LLM Agents

arXiv cs.AI · 2026-05-08 Cached

This paper introduces 'constant-context skill learning,' a framework that moves procedural knowledge from prompts into model weights to reduce token usage and improve privacy for LLM agents. The method achieves strong performance on benchmarks like ALFWorld and WebShop while significantly reducing inference costs.

0 favorites 0 likes
#skill-learning

Stochastic Neural Networks for hierarchical reinforcement learning

OpenAI Blog · 2017-04-10 Cached

OpenAI researchers propose a framework using stochastic neural networks for hierarchical reinforcement learning that pre-trains useful skills guided by a proxy reward, then leverages these skills for faster learning in downstream tasks with sparse rewards or long horizons.

0 favorites 0 likes
← Back to home

Submit Feedback