Model-free Adaptive Dynamic Programming for Unknown Systems

摘要：

In this paper, we present online model-free adaptive critic(AC)schemesbasedonapproximatedynamic programming (ADP) to solve for optimal control problems in both discrete-time and continuous-time domains. In the discretetime case, it is shown that the proposed ADP algorithm is in fact solving the underlying Generalized Algebraic Riccati Equation(GARE) of the corresponding optimal control problem or zerosum game. In the continuous-time domain, an ADP scheme is introduced to solve for the underlying ARE of the optimal control problem. It is shown that this continuous-time ADP scheme is in fact a Quasi-Newton method to solve the ARE. In both time domains, the adaptive critic algorithms are easy to initialize since initial policies are not required to be stabilizing.

关键词： Approximate Dynamic Programming Adaptive Critics Q-learning Policy iterations Optimal control Zero-sum games.

作者: Murad Abu-Khalaf Frank L.Lewis Asma Al-Tamimi Draguna Vrabie

作者单位: Automation and Robotics Research Institute The University of Texas at Arlington 7300 Jack Newell Blvd.S, Ft.Worth, Texas 76118-7115

会议类型: 国际会议

会议名称: 第一届国际计算机新科技与教育学术会议(Proceedings of the First International Conference on Computer Science & Education ICCSE2006)

会议地点: 厦门

会议语种:英文

页码: 105-114

在线出版日期: 2006-07-27（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Model-free Adaptive Dynamic Programming for Unknown Systems