Strategy Entropy as a Measure of Strategy Convergence in Reinforcement Learning

The concept of entropy is introduced into reinforcement learning.The definitions of the local and global strategy entropy are presented.The global strategy entropy is experimentally proved to be the quantitative problem-independent measure of the strategy’s convergence degree.The experimental results show that the learning based on the local strategy entropy improves the learning performance.
Xiaodong Zhuang Zhuo Chen
Electronics & Engineering Dept.,Automation Engineering College,Qingdao University,Qingdao,China College of Information Science & Technology,Qingdao University of Science and Technology,Qingdao,Chi
国际会议
武汉
英文
2008-11-01(万方平台首次上网日期,不代表论文的发表时间)