Study on Handling Range Inputs Methods on C4.5 algorithm
How to successfully build a decision tree remains a focused topic in data mining. Hitherto many scholars have contributed a lot in the betterment of decision tree building algorithms. However, sometimes dataset may have range input attributes and present decision tree building methods, namely mean substitute, minmax substitute and mean-extent substitute, may not be suitable. This paper combines C4.5 and fuzzy mathematics to put forward a method structure in handling range inputs. The new method has important improvements on membership grade and entropy calculation method. Then a validation of the usefulness of the method is presented. The method is thought to be successfully applied to the investigation methodology, mainly in continuous data inputs with inexact data which consists of maximums and minimurns.
C4.5 Decision trees Fuzzy mathematics Investigation methodology
Han Jing-ti Gu Yu-jia
School of Information Management and Engineering, Shanghai University of Finance and Economics (SHUFE),Shanghai 200433, China
国际会议
重庆
英文
47-49
2009-12-25(万方平台首次上网日期,不代表论文的发表时间)