Neural-Network-Based Optimal Control for Discrete-Time Nonlinear Systems Using General Value Iteration
In this paper, we propose a novel adaptive dynamic programming (ADP) scheme based on general value iteration to obtain near optimal control for discrete-time nonlinear systems with continuous state and control space. First, the selection of initial value function is different from the traditional value iteration, and a new method is introduced to demonstrate the convergence property and convergence speed of the value function. Then, the control law obtained at each iteration can stabilize the system under some conditions. At last, three neural networks with Levenberg-Marquardt training algorithm are used to approximate the unknown nonlinear system, the value function and the optimal control law. One simulation example is presented to demonstrate the effectiveness of the present scheme.
Adaptive dynamic programming approximate dynamic programming optimal control value iteration neural networks reinforcement learning
LI Hongliang LIU Derong WANG Ding
State Key Laboratory of Management and Control for Complex SystemsInstitute of Automation, Chinese A State Key Laboratory of Management and Control for Complex Systems Institute of Automation, Chinese
国际会议
The 31st Chinese Control Conference(第三十一届中国控制会议)
合肥
英文
2932-2937
2012-07-01(万方平台首次上网日期,不代表论文的发表时间)