会议专题

Compound Asynchronous Exploration and Exploitation

  Data efficiency has always been a significant key topic for deep reinforcement learning.The main progress has been on sufficient exploration and effective exploitation.However,the two are often discussed separately.Profit from distributed systems,we propose an asynchronous approach to deep reinforcement learning by combining exploration and exploitation.We apply our framework to off-the-shelf deep reinforcement learning algorithms,and experimental results show that our algorithm is superior in final performance and efficiency.

Deep Reinforcement Learning Exploration and Exploitation Asynchronous Methods

Jie Bai Li Liu Yaobing Wang Haoyu Zhang Jianfei Li

Beijing Key Laboratory of Intelligent Space Robotic Systems Technology and Applications,Beijing Inst College of Engineering,Peking University,Beijing,China Science and Technology on Space Intelligent Control Laboratory,Beijing Institute of Control Engineer

国际会议

2019 2nd International Conference on Mechanical, Electronic and Engineering Technology (MEET 2019) 2019年第二届机电与工程技术国际会议

西安

英文

371-379

2019-01-19(万方平台首次上网日期,不代表论文的发表时间)