An Uncertainty-Based Belief Selection Method for POMDP Value Iteration

摘要：

　　Partially Observable Markov Decision Process (POMDP) provides a probabilistic model for decision making under uncertainty.Point-based value iteration algorithms are effective approximate algorithms to solve POMDP problems.Belief selection is a key step of point-based algorithm.In this paper we provide a belief selection method based on the uncertainty of belief point.The algorithm first computes the uncertainties of the belief points that could be reached, and then selects the belief points that have lower uncertainties and whose distances to the current belief set are larger than a threshold.The experimental results indicate that this method is effective to gain an approximate long-term discounted reward using fewer belief states than the other pointbased algorithms.

关键词： POMDP value iteration point-based algorithm belief selection uncertainty

作者: Qi Feng Xuezhong Zhou Houkuan Huang Xiaoping Zhang

作者单位: School of Computer and Information Technology,Beijing Jiaotong University,Beijing 100044,China

会议类型: 国内会议

会议名称: ”数字化中医信息系统“临床术语本体研究专家研讨会

会议地点: 北京

会议语种:英文

页码: 841-849

在线出版日期: 2014-09-01（万方平台首次上网日期，不代表论文的发表时间）

会议专题

An Uncertainty-Based Belief Selection Method for POMDP Value Iteration