markov-decision-processes

#markov-decision-processes

Bellman-Taylor Score Decoding for Markov Decision Processes with State-Dependent Feasible Action Sets

arXiv cs.AI ↗ · 3d ago Cached

This paper introduces Bellman-Taylor Score Decoding, a method to handle state-dependent feasible action sets in Markov decision processes, addressing a key challenge in applying deep reinforcement learning to operations research problems.

0 favorites 0 likes

#markov-decision-processes

Exact Unlearning in Reinforcement Learning

arXiv cs.LG ↗ · 2026-06-04 Cached

This paper formalizes exact unlearning in reinforcement learning, proposing a ρ-TV-stable RL algorithm for tabular MDPs that efficiently removes a user's data influence at a fraction of retraining cost, achieving near-minimax-optimal regret bounds. The work is accepted at ICML and establishes both upper and lower bounds for ρ-TV-stable RL algorithms.

0 favorites 0 likes

#markov-decision-processes

Answer-Set-Programming-based Abstractions for Reinforcement Learning

arXiv cs.AI ↗ · 2026-06-01 Cached

This paper presents an Answer Set Programming (ASP) based implementation of the CARCASS framework for constructing abstractions in reinforcement learning, demonstrating its effectiveness on Blocks World and Minigrid domains.

0 favorites 0 likes

#markov-decision-processes

Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs

arXiv cs.LG ↗ · 2026-05-26 Cached

This paper proposes a quantile Bayesian risk-aware MDP framework for online RL that adaptively balances robustness and exploration over time, providing theoretical regret bounds and demonstrating strong empirical performance.

0 favorites 0 likes

#markov-decision-processes

@RohOnChain: This 1 hour Stanford lecture on Markov Decision Processes will teach you more about the math behind systematic trading …

X AI KOLs Timeline ↗ · 2026-05-12 Cached

The article promotes a Stanford lecture on Markov Decision Processes as a valuable resource for understanding the mathematical foundations of systematic trading, claiming it offers more insight than a short-term internship at major financial firms.

0 favorites 0 likes

markov-decision-processes

Bellman-Taylor Score Decoding for Markov Decision Processes with State-Dependent Feasible Action Sets

Exact Unlearning in Reinforcement Learning

Answer-Set-Programming-based Abstractions for Reinforcement Learning

Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs

@RohOnChain: This 1 hour Stanford lecture on Markov Decision Processes will teach you more about the math behind systematic trading …

Submit Feedback