Tag
ElasticMem introduces a learnable latent memory mechanism for LLM agents that adaptively allocates variable budgets to retrieved memories, improving performance on memory-intensive QA and embodied agent tasks while reducing token costs.
The author reports successful experiments running MRCR v2 with 1M context length on a single MI300X using Qwen2.5-32B and FAISS, achieving competitive scores at low cost.