A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
This paper proposed a pursuit-evasion algorithm based on the Option method from hierarchical reinforcement learning and applied it into multi-robot pursuitevasion game in 2D-Dynamic environment. The algorithm efficiency is studied by comparing it with Q-learning. We decompose the complex task with option method, and divide the learning process into two parts: High-level learning and Lowlevel learning, then design a new mechanism in order to make the learning process perform parallel. The simulation result shows the Option algorithm can efficiently reduce the complexity of pursuit-evasion task, avoid traditional reinforcement learning curse of dimensionality, and improve the learning result.
Pursuit Evasion Problem Reinforcement Learning
Jie Liu Shuhua Liu Hongyan Wu Yu Zhang
Computer Science School Northeast Normal University Changchun,China,130117
国际会议
长沙
英文
1426-1430
2009-04-11(万方平台首次上网日期,不代表论文的发表时间)