The nearest neighbor algorithm of filling missing data based on cluster analysis
Missing data universally exists in various research fields and it results in bad computational performance and effcet.In order to improve the accuracy of filling in the missing data,a filling missing data algorithm of the nearest neighbor based on the cluster analysis is proposed by this paper.After clustering data analysis,the algorithm assigns weights according to the categories and improves calculation formula and filling value calculation based on the MGNN (Mahalanobis-Gray and Nearest Neighbor algorithm) algorithm.The experimental results show that the filling accuracy of the method is higher than traditional KNN algorithm and MGNN algorithm.
Grey mcorrelation Mahalanobis distance Cluster analysis Nearest neighbor alorithm Maximum
Zhang Chi Jin Kai Fong Hong-cai Yang Ting
Department of mathematics and Computer,Wuhan Polytechnic University Electronic banking department,de Department of mathematics and Computer,Wuhan Polytechnic University Wuhan,China
国际会议
杭州
英文
2579-2582
2013-03-22(万方平台首次上网日期,不代表论文的发表时间)