Efficient Algorithm for Computing Link-based Similarity in Real World Networks

被引:10
|
作者
Cai, Yuanzhe
Cong, Gao
Jia, Xu
Liu, Hongyan
He, Jun
Lu, Jiaheng
Du, Xiaoyong
机构
关键词
Similarity Calculation; SimRank; Graph Mining;
D O I
10.1109/ICDM.2009.136
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity calculation has many applications, such as information retrieval, and collaborative filtering, among many others. It has been shown that link-based similarity measure, such as SimRank, is very effective in characterizing the object similarities in networks, such as the Web, by exploiting the object-to-object relationship. Unfortunately, it is prohibitively expensive to compute the link-based similarity in a relatively large graph. In this paper, based on the observation that link-based similarity scores of real world graphs follow the power-law distribution, we propose a new approximate algorithm, namely Power-SimRank, with guaranteed error bound to efficiently compute link-based similarity measure. We also prove the convergence of the proposed algorithm. Extensive experiments conducted on real world datasets and synthetic datasets show that the proposed algorithm outperforms SimRank by four-five times in terms of efficiency while the error generated by the approximation is small.
引用
收藏
页码:734 / 739
页数:6
相关论文
共 50 条
  • [1] Efficient link-based similarity search in web networks
    Zhang, Mingxi
    Hu, Hao
    He, Zhenying
    Gao, Liping
    Sun, Liujie
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8868 - 8880
  • [2] EFFICIENT COMPUTATIONS OF LINK-BASED SIMILARITY MEASURES ON THE GPU
    Jo, Yong-Yeon
    Bae, Duck-Ho
    Kim, Sang-Wook
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012), 2012, : 261 - 265
  • [3] On Link-based Similarity Join
    Sun, Liwen
    Cheng, Reynold
    Li, Xiang
    Cheung, David W.
    Han, Jiawei
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (11): : 714 - 725
  • [4] JacSim: An accurate and efficient link-based similarity measure in graphs
    Hamedani, Masoud Reyhani
    Kim, Sang-Wook
    [J]. INFORMATION SCIENCES, 2017, 414 : 203 - 224
  • [5] PageSim: A novel link-based similarity measure for the world wide web
    Lin, Zhenjiang
    King, Irwin
    Lyu, Michael R.
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 687 - +
  • [6] A Link-Based Similarity for Improving Community Detection Based on Label Propagation Algorithm
    Kamal Berahmand
    Asgarali Bouyer
    [J]. Journal of Systems Science and Complexity, 2019, 32 : 737 - 758
  • [7] A Link-Based Similarity for Improving Community Detection Based on Label Propagation Algorithm
    Berahmand, Kamal
    Bouyer, Asgarali
    [J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2019, 32 (03) : 737 - 758
  • [8] A Link-Based Similarity for Improving Community Detection Based on Label Propagation Algorithm
    BERAHMAND Kamal
    BOUYER Asgarali
    [J]. Journal of Systems Science & Complexity, 2019, 32 (03) : 737 - 758
  • [9] Link-based similarity measures for the classification of Web documents
    Calado, P
    Cristo, M
    Gonçalves, MA
    de Moura, ES
    Ribeiro-Neto, B
    Ziviani, N
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (02): : 208 - 221
  • [10] Link-Based Similarity Measures Using Reachability Vectors
    Yoon, Seok-Ho
    Kim, Ji-Soo
    Ha, Jiwoon
    Kim, Sang-Wook
    Ryu, Minsoo
    Choi, Ho-Jin
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,