Tag
InfoMem introduces a reward mechanism for training chunk-wise memory agents that evaluates final-memory utility using answer-conditioned information gain, improving long-context memory-agent performance under the same RL framework.