会议专题

Hierarchical Reinforcement Learning with OMQ

A novel method of hierarchical reinforcement learning, named OMQ, by integrating Options into MAXQ is presented. In OMQ, the MAXQ is used as basic framework to design hierarchies experientially and learn online, and the Option is used to construct hierarchies automatically. The performance of OMQ is demonstrated in taxi domain and compared with Option and MAXQ. The simulation results show that the OMQ is more practical than Option and MAXQ in partial known environment.

hierarchical reinforcement learning Option MAXQ.

Jing Shen Haibo Liu Guochang Gu

School of Computer Science and Technology, Harbin Engineering University Harbin 150001, China

国际会议

Firth IEEE International Conference on Cognitive Informatics(第五届认知信息国际会议)

北京

英文

584-588

2006-07-17(万方平台首次上网日期,不代表论文的发表时间)