Q-Learning Based Method of Adaptive Path Planning for Mobile Robot

摘要：

Reinforcement learning (RL) is a learning technique based on trial and error. Q-learning is a method of RL algorithms. It has been applied widely in the adaptive path planning for the autonomous mobile robot. In order to decrease the learning space and increase the learning convergent speed, this paper adopts Q-layered learning method to divide the task of searching optimal path into three basic behaviors (or subtasks), namely static obstacle-avoidance, dynamic obstacle-avoidance and goal approaching. Especially in the learning for the static obstacle-avoidance behavior, a novel priority Q search method (PQA) is used to avoid the blindly search of the random search algorithm (RA) which is always used to select actions in Q-learning. PQA uses the sum of weighted vectors pointing away from obstacles to predict the magnitude of the reinforcement reward receiving from the possible state-action after executing the action. Robot controller will select an action based on the result at the next executing time. At last PQA and RA are both simulated in two different environments. The learning results show that learn steps are fewer by PQA than by RA under same environment to achieve the task. And in the total learning periods PQA has the higher task complete percent. PQA is an effective way to solve the problem of the path planning under dynamic and unknown environment.

关键词： Q-learning adaptive path planning mobile robot PQA RA

作者: Yibin Li Caihong Li Zijian Zhang

作者单位: School of Electrical and Automation Engineering Tianjin University Tianjin, china ;School of Control School of Computer Science and Technology Shandong University of Technology Zibo, Shandong province, School of Control Science and Engineering Shandong University Jinan, Shandong province, china

会议类型: 国际会议

会议名称: 2006 IEEE International Conference on Information Acquisition

会议地点: 山东威海

会议语种:英文

页码: 983-987

在线出版日期: 2006-08-20（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Q-Learning Based Method of Adaptive Path Planning for Mobile Robot