会议专题

A Hierarchical Reinforcement Learning Algorithm Based On Heuristic Reward Function

A hierarchical reinforcement learning method based on heuristic reward function is proposed to solve the problem of curse of dimcnsionality, that is the states space will grow exponentially in the number of features, and low convergence speed. The method can reduce state spaces greatly and can enhance the speed of the study. Choose actions with favorable purpose and efficiency so as to optimize reward function and enhance convergence speed. Apply this method to the Tetris game; the experiment result shows that the method can partly solve the curse of dimensionality and can enhance the convergence speed prominent.

hierarchical reinforcement learning heuristic reward function Tetris curse of dimensionality

Qicui Yan Quan Liu Daojing Hu

School of Computer Science and Technology Soochow University Suzhou, Jiangsu, 215006, China

国际会议

The 2nd IEEE International Conference on Advanced Computer Control(第二届先进计算机控制国际会议 ICACC 2010)

沈阳

英文

371-376

2010-03-27(万方平台首次上网日期,不代表论文的发表时间)