dialogue-agents

#dialogue-agents

CoreMem: Riemannian Retrieval and Fisher-Guided Distillation for Long-Term Memory in Dialogue Agents

arXiv cs.CL ↗ · 2026-06-18 Cached

CoreMem proposes a resource-efficient edge-cloud memory architecture for dialogue agents, using Riemannian retrieval with a Fisher-Rao metric and Fisher-guided discrete token distillation to achieve strong accuracy improvements within an 8 GB VRAM budget.

0 favorites 0 likes

#dialogue-agents

G-Long: Graph-Enhanced Memory Management for Efficient Long-Term Dialogue Agents

arXiv cs.CL ↗ · 2026-06-12 Cached

G-Long proposes a graph-enhanced memory management framework for long-term dialogue agents, using a fine-tuned small language model for structured triplet extraction and associative retrieval, achieving state-of-the-art performance in response generation and memory retrieval with reduced computational overhead.

0 favorites 0 likes

#dialogue-agents

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

arXiv cs.AI ↗ · 2026-05-27 Cached

This paper theoretically identifies and mitigates context distribution shift in multi-turn dialogue RL, proposing Calibrated Interactive RL that couples interactive RL with simulator alignment to reduce the sim-to-real gap and achieve state-of-the-art performance.

0 favorites 0 likes

#dialogue-agents

SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution

Hugging Face Daily Papers ↗ · 2026-04-21 Cached

SAVOIR framework applies cooperative game theory and Shapley values to train language agents with improved social intelligence, achieving SOTA on SOTOPIA benchmark and matching GPT-4o performance.

0 favorites 0 likes

dialogue-agents

CoreMem: Riemannian Retrieval and Fisher-Guided Distillation for Long-Term Memory in Dialogue Agents

G-Long: Graph-Enhanced Memory Management for Efficient Long-Term Dialogue Agents

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution

Submit Feedback