会议专题

Empirical Studies on Feature Selection for Software Fault Prediction

  Classification based software fault prediction methods aim to classify the modules into either fault-prone or non-faultprone.Feature selection is a preprocess step used to improve the data quality.However most of previous research mainly focus on feature relevance analysis,there is little work focusing on feature redundancy analysis.Therefore we propose a two-stage framework for feature selection to solve this issue.In particular,during the feature relevance phase,we adopt three different relevance measures to obtain the relevant feature subset.Then during the feature redundancy analysis phase,we use a cluster-based method to eliminate redundant features.To verify the effectiveness of our proposed framework,we choose typical real-world software projects,including Eclipse projects and NASA software project KC1.Final empirical result shows the effectiveness of our proposed framework.

Software Fault Prediction Feature Selection Relevance Analysis Redundancy Analysis

Jiaqiang Chen Shulong Liu Xiang Chen Qing Gu Daoxu Chen

State Key Laboratory for Novel Software Technology Nanjing University,Nanjing,China State Key Laboratory for Novel Software Technology Nanjing University,Nanjing,China;School of Comput

国际会议

第五届亚太网构软件研讨会

长沙

英文

163-166

2013-10-23(万方平台首次上网日期,不代表论文的发表时间)