Deep Recurrent Q-Learning for Partially Observable MDPs
[논문 리뷰] Deep Recurrent Q-Learning for Partially Observable MDPs (DRQN)
[1507.06527] Deep Recurrent Q-Learning for Partially Observable MDPs (arxiv.org) Deep Recurrent Q-Learning for Partially Observable MDPs Deep Reinforcement Learning has yielded proficient controllers for complex tasks. However, these controllers have limited memory and rely on being able to perceive the complete game screen at each decision point. To address these shortcomings, this article arxi..