A Framework for Fast Community Extraction of Large-Scale Networks
Most of the faster community extraction algorithms are based on the Clauset, Newman and Moore (CNM), which is em-ployed for networks with sizes up to 500,000 nodes. The modication proposed by Danon, Diaz and Arenas (DDA) obtains better modularity among CNM and its variations, but there is no improvement in speed as its authors ex-pressed. In this paper, we identify some ineciencies in the data structure employed by former algorithms. We pro-pose a new framework for the algorithm and a modication of the DDA to make it applicable to large-scale networks. For instance, the community extraction of a network with 1 million nodes and 5 million edges was performed in about 14 minutes in contrast to former CNM that required 45 hours (192 times the former CNM, obtaining better modularity). The scalability of our improvements is shown by applying it to networks with sizes up to 10 million nodes, obtaining the best modularity and execution time compared to the former algorithms.
Community analysis Clustering Large-scale networks
Yutaka I. Leon-Suematsu Kikuo Yuta
NiCT/ATR 2-2-2 Hikaridai, Seika-cho, Souraku-gun Kyoto 619-0288, Japan
国际会议
第十七届国际万维网大会(the 17th International World Wide Web Conference)(WWW08)
北京
英文
2008-04-21(万方平台首次上网日期,不代表论文的发表时间)