partial-observability

Tag

Cards List
#partial-observability

Belief Memory: Agent Memory Under Partial Observability

arXiv cs.AI · 2d ago Cached

This paper introduces BeliefMem, a novel memory paradigm for LLM agents that stores multiple candidate conclusions with probabilities to handle partial observability and reduce self-reinforcing errors. Empirical evaluations show it outperforms deterministic baselines on LoCoMo and ALFWorld benchmarks.

0 favorites 0 likes
#partial-observability

Neural Co-state Policies: Structuring Hidden States in Recurrent Reinforcement Learning

arXiv cs.LG · 2d ago Cached

This paper introduces Neural Co-state Policies, establishing a formal link between recurrent reinforcement learning hidden states and the Pontryagin minimum principle to improve interpretability and robustness.

0 favorites 0 likes
← Back to home

Submit Feedback