DDQN
[논문 리뷰] Deep Reinforcement Learning with Double Q-learning (DDQN)
[1509.06461] Deep Reinforcement Learning with Double Q-learning (arxiv.org) Deep Reinforcement Learning with Double Q-learning The popular Q-learning algorithm is known to overestimate action values under certain conditions. It was not previously known whether, in practice, such overestimations are common, whether they harm performance, and whether they can generally be prevented. arxiv.org 이번 논..