regret-minimization

#regret-minimization

Learning in Markovian bandits with non-observable states and constrained decision epochs

arXiv cs.LG ↗ · 3d ago Cached

This paper studies regret minimization in Markovian bandits with non-observable states and constrained decision epochs, introducing a generalization called self-degrading Markovian bandits. The authors propose the UCB-NOM algorithm that achieves nearly logarithmic regret and provide bounds that do not depend on the number of states.

0 favorites 0 likes

#regret-minimization

Regret Minimization with Adaptive Opponents in Repeated Games

Hugging Face Daily Papers ↗ · 2026-06-04 Cached

This paper introduces Repeated Policy Regret (RP-Regret), a game-theoretic metric for regret minimization in repeated games with adaptive opponents, and proposes three algorithms to minimize it, showing that doing so can lead to cooperative equilibria like in Stag-Hunt.

0 favorites 0 likes

#regret-minimization

@StartupArchive_: Jeff Bezos explains how he decided to quit his job and start Amazon At 30 years old, Jeff Bezos had great Wall Street j…

X AI KOLs Following ↗ · 2026-05-17 Cached

Jeff Bezos recounts how he used a regret-minimization framework to decide to quit his job at D.E. Shaw and start Amazon, prioritizing avoiding future regret over fear of failure.

0 favorites 0 likes

regret-minimization

Learning in Markovian bandits with non-observable states and constrained decision epochs

Regret Minimization with Adaptive Opponents in Repeated Games

@StartupArchive_: Jeff Bezos explains how he decided to quit his job and start Amazon At 30 years old, Jeff Bezos had great Wall Street j…

Submit Feedback