markov-decision-processes

#markov-decision-processes

Performance-Driven Environment Abstraction with Multi-Timescale Learning

arXiv cs.LG ↗ · 2026-06-17 Cached

This paper proposes a performance-driven state abstraction method for reinforcement learning that directly optimizes decision quality, using a multi-timescale framework to jointly adapt the policy and a tree-structured abstraction. The algorithm refines or aggregates state space based on Q-value discrepancies, achieving better sample efficiency and faster replanning than baselines.

0 favorites 0 likes

#markov-decision-processes

Lyapunov-Based Sample Complexity Analysis for Weakly-Coupled MDPs

arXiv cs.LG ↗ · 2026-06-15 Cached

This paper studies the sample complexity of learning in average-reward weakly-coupled MDPs and restless bandits, establishing finite-sample PAC guarantees with polynomial complexity using a novel Lyapunov-based analysis framework.

0 favorites 0 likes

#markov-decision-processes

Bellman-Taylor Score Decoding for Markov Decision Processes with State-Dependent Feasible Action Sets

arXiv cs.AI ↗ · 2026-06-10 Cached

This paper introduces Bellman-Taylor Score Decoding, a method to handle state-dependent feasible action sets in Markov decision processes, addressing a key challenge in applying deep reinforcement learning to operations research problems.

0 favorites 0 likes

#markov-decision-processes

Exact Unlearning in Reinforcement Learning

arXiv cs.LG ↗ · 2026-06-04 Cached

This paper formalizes exact unlearning in reinforcement learning, proposing a ρ-TV-stable RL algorithm for tabular MDPs that efficiently removes a user's data influence at a fraction of retraining cost, achieving near-minimax-optimal regret bounds. The work is accepted at ICML and establishes both upper and lower bounds for ρ-TV-stable RL algorithms.

0 favorites 0 likes

#markov-decision-processes

Answer-Set-Programming-based Abstractions for Reinforcement Learning

arXiv cs.AI ↗ · 2026-06-01 Cached

This paper presents an Answer Set Programming (ASP) based implementation of the CARCASS framework for constructing abstractions in reinforcement learning, demonstrating its effectiveness on Blocks World and Minigrid domains.

0 favorites 0 likes

#markov-decision-processes

Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs

arXiv cs.LG ↗ · 2026-05-26 Cached

This paper proposes a quantile Bayesian risk-aware MDP framework for online RL that adaptively balances robustness and exploration over time, providing theoretical regret bounds and demonstrating strong empirical performance.

0 favorites 0 likes

#markov-decision-processes

@RohOnChain: This 1 hour Stanford lecture on Markov Decision Processes will teach you more about the math behind systematic trading …

X AI KOLs Timeline ↗ · 2026-05-12 Cached

The article promotes a Stanford lecture on Markov Decision Processes as a valuable resource for understanding the mathematical foundations of systematic trading, claiming it offers more insight than a short-term internship at major financial firms.

0 favorites 0 likes

markov-decision-processes

Performance-Driven Environment Abstraction with Multi-Timescale Learning

Lyapunov-Based Sample Complexity Analysis for Weakly-Coupled MDPs

Bellman-Taylor Score Decoding for Markov Decision Processes with State-Dependent Feasible Action Sets

Exact Unlearning in Reinforcement Learning

Answer-Set-Programming-based Abstractions for Reinforcement Learning

Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs

@RohOnChain: This 1 hour Stanford lecture on Markov Decision Processes will teach you more about the math behind systematic trading …

Submit Feedback