Small Data Set Learning with Synthetic Samples and Area Membership Values

摘要：

A common strategy in manufacturing systems is to execute pilot runs before the mass production. We model the limited data obtained from pilot runs,and try to shorten the lead time required for predict future production in this study. A manufacturing system is usually comprehensive;Artificial Neural Networks are commonly applied to extract management knowledge from acquired data for its non-linear properties. It is the fundamental assumption for Artificial Neural Networks to get as large a number of training data as needed;nevertheless,this is often not achievable for pilot runs because there are few data obtained during trial stages,and theoretically this means that the obtained knowledge is fragile. The purpose of this research is through the proposed procedure to decrease the Artificial Neural Networks prediction error rate in small data set problems. Based on a consideration of dependent data attributes,the proposed procedure is designed to utilize extreme value theory and statistical prediction interval calculations to derive fuzzybased synthetic samples to fill sparse data information gaps. After synthetic samples are generated,area membership values are applied to combine data values,and occurrence possibility for every sample is used as the input of Artificial Neural Networks. The results of this research indicate that the prediction error rate can be significantly decreased by applying the proposed procedure to a very small data set with attribute dependency.

关键词： Small data set Area Membership Function Pilot runs Neural Networks Prediction Interval

作者: Yao-San Lin Tung-I Tsai Li-Te Yin

作者单位: Department of Information Management,Chung Hwa University of Medical Technology,Tainan,Taiwan Department of Information Management,Shu-Te University,Kaohsiung,Taiwan Department of Optometry,Chung Hwa University of Medical Technology,Tainan,Taiwan

会议类型: 国际会议

会议名称: 2011 International Conference on Opto-Electronics Engineering and Information Science(2011光电电子工程与信息科学国际会议 ICOEIS 2011)

会议地点: 西安

会议语种:英文

页码: 1625-1629

在线出版日期: 2011-12-23（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Small Data Set Learning with Synthetic Samples and Area Membership Values