Singular Value Decomposition for Dimensionality Reduction in Unsupervised Text Learning Problems
Partitioning vast amounts of text documents is a challenging problem due to a high dimensional representation of the documents. In this study, we investigate the quality of text document clustering when Singular Value Decomposition (SVD) is used to reduce the dimension of the documents. The results show that the quality of the clusters is very comparable to that of when the dimensions are not reduced. In addition, the computational cost to cluster documents can be reduced significantly when the clustering is done on a small dimension.
Singular Value Decomposition Unsupervised Text Learning
Taufik Fuadi Abidin Bustami Yusuf Munzir Umran
Data Mining and Information Retrieval Research Group Mathematics Department College of Mathematics and Natural Science Syiah Kuala University - Banda Aceh - Indonesia
国际会议
上海
英文
422-426
2010-06-22(万方平台首次上网日期,不代表论文的发表时间)