An Improved Genetic Algorithm for Text Clustering
The genetic algorithm (GA) is a self-adapted probability search method used to solve optimization problems which has been applied widely in science and engineering.In this paper,we propose an improved variable string length genetic algorithm (IVGA) for text clustering.Our algorithm has been exploited for automatically evolving the optimal number of clusters as well as providing proper data set clustering.The chromosome is encoded by special indices to indicate the location of each gene.More effective version of evolutional steps can automatically adjust the influence between the diversity of the population and selective pressure during generations.The superiority of the improved genetic algorithm over conventional variable string length genetic algorithm (VGA) is demonstrated by providing proper text clustering.
text clustering genertic algorithm gene index
Shidong YU Hang LI Qi XU
College of Software Shenyang Normal University Shenyang 110034,China Physical Science and Technical College Shenyang Normal University Shenyang 110034,China
国际会议
秦皇岛
英文
342-345
2010-11-05(万方平台首次上网日期,不代表论文的发表时间)