pomdp

Tag

Cards List
#pomdp

The Context Gathering Decision Process: A POMDP Framework for Agentic Search

arXiv cs.AI · 3d ago Cached

This paper introduces the Context Gathering Decision Process (CGDP), a POMDP framework to model LLM agent search behavior, proposing interventions that improve multi-hop reasoning and reduce token usage without performance degradation.

0 favorites 0 likes
#pomdp

Non-Myopic Active Feature Acquisition via Pathwise Policy Gradients

arXiv cs.LG · 6d ago Cached

This paper introduces NM-PPG, a non-myopic active feature acquisition method using pathwise policy gradients to optimize sequential feature selection in costly prediction scenarios.

0 favorites 0 likes
← Back to home

Submit Feedback