Time Scale Risk-Sensitive Hierarchical Structure Control Problem

摘要：

Hierarchical structure control problem solving is one of the keys to study a controlled Agent who learns its environment efficiently with large state space. It is more and more important to some practical applications in large state space control problems because of requiring an Agent to fit more complex environment, specially, in the area of studying for machine learning. We always regard the Markov Decision Processes (MDP) as the environment model of a reinforcement learning Agent.Bellmans optimal control equation is the base to solving this problem in simulating experiments and practical applications. We have introduced a kind of more complex and more practical environment for learning Agent in our previous work. Combining two concepts of risk-sensitive and multi-time scale, we have proposed a new conception which we refer it to multi-time scale risk-sensitive Markov Decision Processes. Under this new conception,we have modeled the basic Bellmans optimal control equation. Our motivation in this paper is to investigate this problem continually and gives a set of basic results.These results are all cores for framework of solving multi-time scale risk-sensitive control problems.

关键词： Hierarchical Structure Control Markov Decision Processes Multi-time Scale Risk Sensitive Bellman Equation

作者: Changming Yin Huanwen Chen Lijuan Xie

作者单位: College of Computer and Communicational Engineering, Changsha University of Science and Technology C College of Computer and Communicational Engineering, Changsha University of Science and Technology C

会议类型: 国际会议

会议名称: 2006 International Symposium on Distributed Computing and Applications to Business,Engineering and Science(2006年国际电子、工程及科学领域的分布式计算应用学术研讨会)

会议地点: 杭州

会议语种:英文

页码: 990-993

在线出版日期: 2006-10-12（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Time Scale Risk-Sensitive Hierarchical Structure Control Problem