会议专题

Time Scale Risk-Sensitive Hierarchical Structure Control Problem

Hierarchical structure control problem solving is one of the keys to study a controlled Agent who learns its environment efficiently with large state space. It is more and more important to some practical applications in large state space control problems because of requiring an Agent to fit more complex environment, specially, in the area of studying for machine learning. We always regard the Markov Decision Processes (MDP) as the environment model of a reinforcement learning Agent.Bellmans optimal control equation is the base to solving this problem in simulating experiments and practical applications. We have introduced a kind of more complex and more practical environment for learning Agent in our previous work. Combining two concepts of risk-sensitive and multi-time scale, we have proposed a new conception which we refer it to multi-time scale risk-sensitive Markov Decision Processes. Under this new conception,we have modeled the basic Bellmans optimal control equation. Our motivation in this paper is to investigate this problem continually and gives a set of basic results.These results are all cores for framework of solving multi-time scale risk-sensitive control problems.

Hierarchical Structure Control Markov Decision Processes Multi-time Scale Risk Sensitive Bellman Equation

Changming Yin Huanwen Chen Lijuan Xie

College of Computer and Communicational Engineering, Changsha University of Science and Technology C College of Computer and Communicational Engineering, Changsha University of Science and Technology C

国际会议

2006 International Symposium on Distributed Computing and Applications to Business,Engineering and Science(2006年国际电子、工程及科学领域的分布式计算应用学术研讨会)

杭州

英文

990-993

2006-10-12(万方平台首次上网日期,不代表论文的发表时间)