Efficient and Effective Similarity Search over Bipartite Graphs

被引:2
|
作者
Yang, Renchi [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
关键词
Bipartite Graphs; Similarity Search; Approximate Algorithms; PERSONALIZED PAGERANK QUERIES; RANDOM-WALK; COMPUTATION; ALGORITHMS;
D O I
10.1145/3485447.3511959
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity search over a bipartite graph aims to retrieve from the graph the nodes that are similar to each other, which finds applications in various fields such as online advertising, recommender systems etc. Existing similarity measures either (i) overlook the unique properties of bipartite graphs, or (ii) fail to capture highorder information between nodes accurately, leading to suboptimal result quality. Recently, Hidden Personalized PageRank (HPP) is applied to this problem and found to be more effective compared with prior similarity measures. However, existing solutions for HPP computation incur significant computational costs, rendering it inefficient especially on large graphs. In this paper, we first identify an inherent drawback of HPP and overcome it by proposing bidirectional HPP (BHPP). Then, we formulate similarity search over bipartite graphs as the problem of approximate BHPP computation, and present an efficient solution Approx-BHPP. Specifically, Approx-BHPP offers rigorous theoretical accuracy guarantees with optimal computational complexity by combining deterministic graph traversal with matrix operations in an optimized and non-trivial way. Moreover, our solution achieves significant gain in practical efficiency due to several carefully-designed optimizations. Extensive experiments, comparing BHPP against 8 existing similarity measures over 7 real bipartite graphs, demonstrate the effectiveness of BHPP on query rewriting and item recommendation. Moreover, Approx-BHPP outperforms baseline solutions often by up to orders of magnitude in terms of computational time on both small and large datasets.
引用
收藏
页码:308 / 318
页数:11
相关论文
共 50 条
  • [21] Similarity Search Over Graphs Using Localized Spectral Analysis
    Aizenbud, Yariv
    Averbuch, Amir
    Shabat, Gil
    Ziv, Guy
    2017 INTERNATIONAL CONFERENCE ON SAMPLING THEORY AND APPLICATIONS (SAMPTA), 2017, : 635 - 638
  • [22] Efficient and effective similarity search over probabilistic data based on Earth Mover’s Distance
    Jia Xu
    Zhenjie Zhang
    Anthony K. H. Tung
    Ge Yu
    The VLDB Journal, 2012, 21 : 535 - 559
  • [23] Efficient and Effective Similarity Search over Probabilistic Data based on Earth Mover's Distance
    Xu, Jia
    Zhang, Zhenjie
    Tung, Anthony K. H.
    Yu, Ge
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 758 - 769
  • [24] Efficient and effective similarity search over probabilistic data based on Earth Mover's Distance
    Xu, Jia
    Zhang, Zhenjie
    Tung, Anthony K. H.
    Yu, Ge
    VLDB JOURNAL, 2012, 21 (04): : 535 - 559
  • [25] Efficient Maximum Edge-Weighted Biclique Search on Large Bipartite Graphs
    Wang, Jianhua
    Yang, Jianye
    Zhang, Chengyuan
    Lin, Xuemin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 7921 - 7934
  • [26] Effective Community Search over Large Spatial Graphs
    Fang, Yixiang
    Cheng, Reynold
    Li, Xiaodong
    Luo, Siqiang
    Hu, Jiafeng
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (06): : 709 - 720
  • [27] Efficient Closest Community Search over Large Graphs
    Cai, Mingshen
    Chang, Lijun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT II, 2020, 12113 : 569 - 587
  • [28] Efficient Subgraph Search over Large Uncertain Graphs
    Yuan, Ye
    Wang, Guoren
    Wang, Haixun
    Chen, Lei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (11): : 876 - 886
  • [29] Efficient Enumeration of Bipartite Subgraphs in Graphs
    Wasa, Kunihiro
    Uno, Takeaki
    COMPUTING AND COMBINATORICS (COCOON 2018), 2018, 10976 : 454 - 466
  • [30] Efficient (α, β)-core computation in bipartite graphs
    Liu, Boge
    Yuan, Long
    Lin, Xuemin
    Qin, Lu
    Zhang, Wenjie
    Zhou, Jingren
    VLDB JOURNAL, 2020, 29 (05): : 1075 - 1099