Multi-Sensor Cooperative Tracking Using Distributed Nash Q-Learning
Traditional target tracking algorithm has a disadvantage of excessive dependence on the environment model.Thus a multi-sensor cooperative tracking method using distributed Nash Q-learning was proposed.Distributed Nash Q-leaming with model-free was firstly described.Then sensor action and reward function were defined,which both are very crucial to the learning.Sensor action was only subjected to angle control,and reward function was given by calculating the trace of one time-step prediction error covariance.Nash tragedy can not be directly calculated,therefore,a probability statistics method using Bayesian inference was used to update the Q function.Simulation of passive tracking merely with angle measurements shows that this algorithm can enhance the adaptation to environment change and the tracking accuracy.
Reinforcement learning Nash Q-learning Target tracking Extended Kalman filtering Multi-sensor cooperation Distribution
Jia Cai Changqiang Huang Haifeng Guo
Aeronautics and Astronautics Engineering Institute,Air Force Engineering University,Xian,China
国际会议
广州
英文
1475-1478
2012-11-16(万方平台首次上网日期,不代表论文的发表时间)