Multi-agent Cooperation by Q-learning in Continuous Action Domain

摘要：

In this paper we propose Q-learning with continuous action space and extend this algorithm to a multi-agent system.Conventional Q-learning needs a pre-defined and discrete state space.But it is not practical because the states of the environment in the real world and actions are both continuous.The algorithm will use a concept that is similar to the SRV(Stochastic Real-Valued Unit)to train the actions in each state.The convergence of the SRV may fall into local solution even if it has never reached the optimal solution.In order to overcome this drawback,the Q-learning with SRRV(Stochastic Recording Real-Valued unit)is proposed,and it shows that the SRRV will converge more quickly.

关键词： Q-learning Stochastic Real-Valued Unit

作者: Kao-Shing Hwang Member of IEEE Yu-Hong Lin Chia-Yue Lo

作者单位: Electrical Engineering,National Chung Cheng University 168,University Rd.,Min Hsiung Chia-Yi,Taiwan,ROC

会议类型: 国际会议

会议名称: 第一届智能网络与智能系统国际会议(ICINIS 2008)(The First International Conference on Intelligent Networks and Intelligent Systems)

会议地点: 武汉

会议语种:英文

在线出版日期: 2008-11-01（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Multi-agent Cooperation by Q-learning in Continuous Action Domain