会议专题

Comparative Studies of Model Performance Based on Different Data Sampling Methods

  This paper presents a comparative study on the effects of different data sampling methods to the performance of data-driven models.An engineering benchmark modeling problem is investigated,focused on which,three sampling methods,i.e.orthogonal Latin sampling,uniform design sampling and random sampling are used to generate the training data of different property.Six typical data-driven modeling techniques,which consist of artificial intelligent methods (least squares support vector machine,BP neural network and RBF neural network) and statistical methods (multiple linear regression,linear and nonlinear partial least squares regressions),are performed to make the comparison.The root mean square error (RMSE),R square (R2) and mean relative error (MRE) values are taken as the comparison criteria.The results reveal that data sampling and data property play a very key role in establishing an accurate data-driven model.

orthogonal Latin sampling, uniform design data-driven model least squares support vector machine artificial neural network partial least squares

You Lv Jizhen Liu Tingting Yang

The State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources,North China Electric Power University, Changping, Beijing 102206, China

国际会议

the 25th Chinese Control and Decision Conference(第25届中国控制与决策会议)

贵阳

英文

2731-2735

2013-05-01(万方平台首次上网日期,不代表论文的发表时间)