会议专题

Multi-Sensor Cooperative Tracking Using Distributed Nash Q-Learning

  Traditional target tracking algorithm has a disadvantage of excessive dependence on the environment model.Thus a multi-sensor cooperative tracking method using distributed Nash Q-learning was proposed.Distributed Nash Q-leaming with model-free was firstly described.Then sensor action and reward function were defined,which both are very crucial to the learning.Sensor action was only subjected to angle control,and reward function was given by calculating the trace of one time-step prediction error covariance.Nash tragedy can not be directly calculated,therefore,a probability statistics method using Bayesian inference was used to update the Q function.Simulation of passive tracking merely with angle measurements shows that this algorithm can enhance the adaptation to environment change and the tracking accuracy.

Reinforcement learning Nash Q-learning Target tracking Extended Kalman filtering Multi-sensor cooperation Distribution

Jia Cai Changqiang Huang Haifeng Guo

Aeronautics and Astronautics Engineering Institute,Air Force Engineering University,Xian,China

国际会议

the 2012 International Conference on Manufacturing Engineering and Automation (2012年制造工程与自动化国际会议(ICMEA2012))

广州

英文

1475-1478

2012-11-16(万方平台首次上网日期,不代表论文的发表时间)