A Teapot Graph and Its Hierarchical Structure of the Chinese Web
The shape of the Web in terms of its graphical structure has been a widely interested topic. Two graphs, Bow Tie and Daisy, have stood out from previous research. In this work, we take a different approach, by viewing the Web as a hierarchy of three levels, namely page level, host level, and domain level. Such structures are analyzed and compared with a snapshot of Chinese Web in early 2006, involving 830 million pages, 17 million hosts, and 0.8 million domains. Some interesting results have emerged. For example, the Chinese Web appears more like a teapot (with a large size of SCC, a medium size of IN and a small size of OUT) at page level than the classic bow tie or daisy shape. Some challenging phenomena are also observed. For example, the Ins become much smaller than OUTs at host and domain levels. Future work will tackle these puzzles.
Bow Tie Graph Daisy Graph Teapot Graph Self Similarity
Jonathan J. H. Zhu Tao Meng Zhengmao Xie Geng Li Xiaoming Li
Dept of Media & Communication City University of Hong Kong School of EECS, Peking University State Key Laboratory of Advanced Optical Communication Systems & Networks, Peking University
国际会议
第十七届国际万维网大会(the 17th International World Wide Web Conference)(WWW08)
北京
英文
2008-04-21(万方平台首次上网日期,不代表论文的发表时间)