Duality in Deep Reinforcement Learning--Theory
More and deeper reinforcement learning algorithms have been proposed and demonstrated on a series of decision-making domains.However,little research has been hammered at algorithm extraction.With duality in deep reinforcement learning substantially summarized,we propose a conceptually simple framework for deep reinforcement learning based on duality.Then,we propose the dual method of prioritized sampling: prioritized learning.Finally,we give the formula and analysis for the duality with priority.The algorithm implementation and experiment will be put on Part Ⅱ-Implementation.
Deep Reinforcement Learning Duality Prioritized Sampling Prioritized Learning
Jie Bai Jianfei Li Zihao Luo Yaobing Wang Li Liu
Peking University,Center of Excellence for Intelligent Robotics,Beijing Institute of Spacecraft System Engineering,Beijing,China
国际会议
西安
英文
59-66
2019-01-19(万方平台首次上网日期,不代表论文的发表时间)