A Hierarchical Reinforcement Learning Algorithm Based On Heuristic Reward Function
A hierarchical reinforcement learning method based on heuristic reward function is proposed to solve the problem of curse of dimcnsionality, that is the states space will grow exponentially in the number of features, and low convergence speed. The method can reduce state spaces greatly and can enhance the speed of the study. Choose actions with favorable purpose and efficiency so as to optimize reward function and enhance convergence speed. Apply this method to the Tetris game; the experiment result shows that the method can partly solve the curse of dimensionality and can enhance the convergence speed prominent.
hierarchical reinforcement learning heuristic reward function Tetris curse of dimensionality
Qicui Yan Quan Liu Daojing Hu
School of Computer Science and Technology Soochow University Suzhou, Jiangsu, 215006, China
国际会议
The 2nd IEEE International Conference on Advanced Computer Control(第二届先进计算机控制国际会议 ICACC 2010)
沈阳
英文
371-376
2010-03-27(万方平台首次上网日期,不代表论文的发表时间)