Tag
The article details a technique to play Atari ST music on an Amiga by emulating the YM2149 sound chip using the Amiga's PAULA chip, requiring zero CPU usage and allowing the CPU to focus on other tasks like rendering graphics.
This paper revisits the Adam optimizer for streaming reinforcement learning, demonstrating that established methods like DQN and C51 perform well when properly tuned. The authors propose Adaptive Q(lambda), which combines eligibility traces with Adam's variance adaptation to surpass existing streaming RL methods on 55 Atari games.
OpenAI presents a novel exploration strategy for deep reinforcement learning using ensembles of Q-functions with upper-confidence bounds (UCB), demonstrating significant performance improvements on the Atari benchmark.