Joint Agent Memory and Exploration Learning via Novelty Signals

Hugging Face Daily Papers 06/01/26, 12:00 AM Papers

agent-memory exploration novelty-driven open-ended-environments latent-memory code-coverage

Summary

This paper introduces JAMEL, a framework that jointly trains agentic memory and exploration policies using novelty signals, enabling efficient exploration in open-ended environments with reduced computational costs.

In open-ended environments, exploration is fundamental for autonomous agents, yet current language model agents struggle with this. Effective exploration requires memory, but retaining raw interaction histories is computationally expensive over long trajectories. While latent memory offers a solution to compress interaction histories, its training lacks reliable supervisory signals. We introduce Joint Agent Memory and Exploration Learning (JAMEL), a framework that trains agentic memory and exploration policy together through novelty-driven interaction. We observe that memory and exploration form a mutually dependent loop: sustained exploration requires memory to distinguish exhausted behaviors from unseen ones, while novelty-seeking interaction provides the supervision needed to make memory useful for future exploration. By utilizing deterministic and persistent novelty signals such as code coverage in the GUI domain, we provide natural, annotation-free supervision for the memory module. Empirical evaluations demonstrate that \ours successfully generalizes to unseen environments. Its exploration capability outperforms open-weight baselines and rivals the exploration depth of a closed-source model while reducing token consumption. Our code and model are open-sourced at https://github.com/MobileLLM/JAMEL.

Original Article

View Cached Full Text

Cached at: 06/02/26, 03:37 PM

Paper page - Joint Agent Memory and Exploration Learning via Novelty Signals

Source: https://huggingface.co/papers/2606.01528

Abstract

Joint Agent Memory and Exploration Learning (JAMEL) framework trains memory and exploration policies together through novelty-driven interaction, enabling effective exploration in open-ended environments with reduced computational costs.

Inopen-ended environments, exploration is fundamental for autonomous agents, yet current language model agents struggle with this. Effective exploration requires memory, but retaining raw interaction histories is computationally expensive over long trajectories. Whilelatent memoryoffers a solution to compress interaction histories, its training lacks reliable supervisory signals. We introduce JointAgent Memoryand Exploration Learning (JAMEL), a framework that trains agentic memory andexploration policytogether throughnovelty-driven interaction. We observe that memory and exploration form a mutually dependent loop: sustained exploration requires memory to distinguish exhausted behaviors from unseen ones, while novelty-seeking interaction provides the supervision needed to make memory useful for future exploration. By utilizing deterministic andpersistent novelty signalssuch ascode coveragein the GUI domain, we provide natural, annotation-free supervision for the memory module. Empirical evaluations demonstrate that \ours successfully generalizes to unseen environments. Its exploration capability outperforms open-weight baselines and rivals the exploration depth of aclosed-source modelwhile reducingtoken consumption. Our code and model are open-sourced at https://github.com/MobileLLM/JAMEL.

View arXiv page View PDF Project page GitHub3 Add to collection

Get this paper in your agent:

hf papers read 2606\.01528

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2606.01528 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2606.01528 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2606.01528 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

Joint Agent Memory and Exploration Learning via Novelty Signals

Paper page - Joint Agent Memory and Exploration Learning via Novelty Signals

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

Learning to Explore: Scaling Agentic Reasoning via Exploration-Aware Policy Optimization

AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning

Some considerations on learning to explore via meta-reinforcement learning

Look Before You Leap: Autonomous Exploration for LLM Agents

Learning to Learn from Multimodal Experience

Submit Feedback

Similar Articles

Learning to Explore: Scaling Agentic Reasoning via Exploration-Aware Policy Optimization

AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning

Some considerations on learning to explore via meta-reinforcement learning

Look Before You Leap: Autonomous Exploration for LLM Agents

Learning to Learn from Multimodal Experience