Scalable Community Search over Large-scale Graphs based on Graph Transformer

被引:0
|
作者
Wang, Yuxiang [1 ]
Gou, Xiaoxuan [1 ]
Xu, Xiaoliang [1 ]
Geng, Yuxia [1 ]
Ke, Xiangyu [2 ]
Wu, Tianxing [3 ]
Yu, Zhiyuan [1 ]
Chen, Runhuai [1 ]
Wu, Xiangying [1 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Southeast Univ, Nanjing, Jiangsu, Peoples R China
关键词
Community Search; Graph Transformer; EFFICIENT; INFORMATION;
D O I
10.1145/3626772.3657771
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a graph G and a query node q, community search (CS) aims to find a structurally cohesive subgraph from G that contains q CS is widely used in many real-world applications, such as online recommendation and expert finding. Recently, the rise of learning-based CS methods has garnered extensive research interests, showcasing the promising potential of neural solutions. However, there remains room for optimization: (1) They initialize node features via classical methods, e.g., one-hot, random, and position encoding, which may fall short in capturing valuable community cohesiveness-related features. (2) The reliance on GCN or GCN-like models poses challenges in scaling to large graphs. (3) Existing methods do not adapt well to dynamic graphs, often requiring retraining from scratch. To handle this, we present CSFormer, a scalable CS based on Graph Transformer. First, we present a novel l -hop neighborhood community vector based on n-order h-index to represent each node's community features, generating a sequence of feature vectors by varying the neighborhood scope l Then, we build a Transformer backbone to learn a good graph embedding that carries rich community features, based on which we perform a prediction-filteringbased online CS to efficiently return a community of q We extend CSFormer to dynamic graphs and various community models. Extensive experiments on seven real-world graphs showour solution's superiority on effectiveness, e.g., we attain an average improvement of 20.6% in F1-score compared to the latest competitors.
引用
收藏
页码:1680 / 1690
页数:11
相关论文
共 50 条
  • [1] Scalable and Efficient Flow-Based Community Detection for Large-Scale Graph Analysis
    Bae, Seung-Hee
    Halperin, Daniel
    West, Jevin D.
    Rosvall, Martin
    Howe, Bill
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2017, 11 (03)
  • [2] Efficient and Effective Community Search on Large-scale Bipartite Graphs
    Wang, Kai
    Zhang, Wenjie
    Lin, Xuemin
    Zhang, Ying
    Qin, Lu
    Zhang, Yuting
    [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 85 - 96
  • [3] A Scalable Distributed Louvain Algorithm for Large-scale Graph Community Detection
    Zeng, Jianping
    Yu, Hongfeng
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 268 - 278
  • [4] A large-scale graph search algorithms Based LRU
    Xiao Li
    Xiao Jing-Zhong
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 3, 2011, : 72 - 75
  • [5] Pagrol: PArallel GRaph OLap over Large-scale Attributed Graphs
    Wang, Zhengkui
    Fan, Qi
    Wang, Huiju
    Tan, Kian-Lee
    Agrawal, Divyakant
    El Abbadi, Amr
    [J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 496 - 507
  • [6] Masked Graph Transformer for Large-Scale Recommendation
    Chen, Huiyuan
    Xu, Zhe
    Yeh, Chin-Chia Michael
    Lai, Vivian
    Zheng, Yan
    Xu, Minghua
    Tong, Hanghang
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2502 - 2506
  • [7] Scalable Motif Counting for Large-scale Temporal Graphs
    Gao, Zhongqiang
    Cheng, Chuanqi
    Yu, Yanwei
    Cao, Lei
    Huang, Chao
    Dong, Junyu
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2656 - 2668
  • [8] Large-Scale Graph Neural Architecture Search
    Guan, Chaoyu
    Wang, Xin
    Chen, Hong
    Zhang, Ziwei
    Zhu, Wenwu
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] A graph-based cache for large-scale similarity search engines
    Veronica Gil-Costa
    Mauricio Marin
    Carolina Bonacic
    Roberto Solar
    [J]. The Journal of Supercomputing, 2018, 74 : 2006 - 2034
  • [10] Community search over large semantic-based attribute graphs
    Peiying Lin
    Siyang Yu
    Xu Zhou
    Peng Peng
    Kenli Li
    Xiangke Liao
    [J]. World Wide Web, 2022, 25 : 927 - 948