Efficient and Effective Similarity Search over Bipartite Graphs

被引:2
|
作者
Yang, Renchi [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
关键词
Bipartite Graphs; Similarity Search; Approximate Algorithms; PERSONALIZED PAGERANK QUERIES; RANDOM-WALK; COMPUTATION; ALGORITHMS;
D O I
10.1145/3485447.3511959
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity search over a bipartite graph aims to retrieve from the graph the nodes that are similar to each other, which finds applications in various fields such as online advertising, recommender systems etc. Existing similarity measures either (i) overlook the unique properties of bipartite graphs, or (ii) fail to capture highorder information between nodes accurately, leading to suboptimal result quality. Recently, Hidden Personalized PageRank (HPP) is applied to this problem and found to be more effective compared with prior similarity measures. However, existing solutions for HPP computation incur significant computational costs, rendering it inefficient especially on large graphs. In this paper, we first identify an inherent drawback of HPP and overcome it by proposing bidirectional HPP (BHPP). Then, we formulate similarity search over bipartite graphs as the problem of approximate BHPP computation, and present an efficient solution Approx-BHPP. Specifically, Approx-BHPP offers rigorous theoretical accuracy guarantees with optimal computational complexity by combining deterministic graph traversal with matrix operations in an optimized and non-trivial way. Moreover, our solution achieves significant gain in practical efficiency due to several carefully-designed optimizations. Extensive experiments, comparing BHPP against 8 existing similarity measures over 7 real bipartite graphs, demonstrate the effectiveness of BHPP on query rewriting and item recommendation. Moreover, Approx-BHPP outperforms baseline solutions often by up to orders of magnitude in terms of computational time on both small and large datasets.
引用
收藏
页码:308 / 318
页数:11
相关论文
共 50 条
  • [1] Efficient Similarity Search for Sets over Graphs
    Wang, Yue
    Feng, Zonghao
    Chen, Lei
    Li, Zijian
    Jian, Xun
    Luo, Qiong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (02) : 444 - 458
  • [2] Effective and efficient community search with size constraint on bipartite graphs
    Zhou, Keqi
    Xin, Junchang
    Chen, Jinyi
    Zhang, Xian
    Wang, Beibei
    Wang, Zhiqiong
    INFORMATION SCIENCES, 2023, 647
  • [3] Efficient and Effective Community Search on Large-scale Bipartite Graphs
    Wang, Kai
    Zhang, Wenjie
    Lin, Xuemin
    Zhang, Ying
    Qin, Lu
    Zhang, Yuting
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 85 - 96
  • [4] Maximum Biplex Search over Bipartite Graphs
    Luo, Wensheng
    Li, Kenli
    Zhou, Xu
    Gao, Yunjun
    Li, Keqin
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 898 - 910
  • [5] An Efficient Similarity Search Framework for SimRank over Large Dynamic Graphs
    Shao, Yingxia
    Cui, Bin
    Chen, Lei
    Liu, Mingming
    Xie, Xing
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (08): : 838 - 849
  • [6] Effective and Efficient Community Search Over Large Directed Graphs
    Fang, Yixiang
    Wang, Zhongran
    Cheng, Reynold
    Wang, Hongzhi
    Hu, Jiafeng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (11) : 2093 - 2107
  • [7] Effective and Efficient Community Search over Large Directed Graphs
    Fang, Yixiang
    Wang, Zhongran
    Cheng, Reynold
    Wang, Hongzhi
    Hu, Jiafeng
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 2157 - 2158
  • [8] Effective Community Search on Large Attributed Bipartite Graphs
    Xu, Zongyu
    Zhang, Yihao
    Yuan, Long
    Qian, Yuwen
    Chen, Zi
    Zhou, Mingliang
    Mao, Qin
    Pan, Weibin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (02)
  • [9] Efficient Top-k s-Biplexes Search over Large Bipartite Graphs
    Xu, Zhenxiang
    Liu, Yiping
    Zhou, Yi
    Hao, Yimin
    Wang, Zhengren
    arXiv,
  • [10] Communication Efficient Decentralized Learning Over Bipartite Graphs
    Ben Issaid, Chaouki
    Elgabli, Anis
    Park, Jihong
    Bennis, Mehdi
    Debbah, Merouane
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (06) : 4150 - 4167