Scalable Community Search over Large-scale Graphs based on Graph Transformer

被引:0
|
作者
Wang, Yuxiang [1 ]
Gou, Xiaoxuan [1 ]
Xu, Xiaoliang [1 ]
Geng, Yuxia [1 ]
Ke, Xiangyu [2 ]
Wu, Tianxing [3 ]
Yu, Zhiyuan [1 ]
Chen, Runhuai [1 ]
Wu, Xiangying [1 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Southeast Univ, Nanjing, Jiangsu, Peoples R China
关键词
Community Search; Graph Transformer; EFFICIENT; INFORMATION;
D O I
10.1145/3626772.3657771
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a graph G and a query node q, community search (CS) aims to find a structurally cohesive subgraph from G that contains q CS is widely used in many real-world applications, such as online recommendation and expert finding. Recently, the rise of learning-based CS methods has garnered extensive research interests, showcasing the promising potential of neural solutions. However, there remains room for optimization: (1) They initialize node features via classical methods, e.g., one-hot, random, and position encoding, which may fall short in capturing valuable community cohesiveness-related features. (2) The reliance on GCN or GCN-like models poses challenges in scaling to large graphs. (3) Existing methods do not adapt well to dynamic graphs, often requiring retraining from scratch. To handle this, we present CSFormer, a scalable CS based on Graph Transformer. First, we present a novel l -hop neighborhood community vector based on n-order h-index to represent each node's community features, generating a sequence of feature vectors by varying the neighborhood scope l Then, we build a Transformer backbone to learn a good graph embedding that carries rich community features, based on which we perform a prediction-filteringbased online CS to efficiently return a community of q We extend CSFormer to dynamic graphs and various community models. Extensive experiments on seven real-world graphs showour solution's superiority on effectiveness, e.g., we attain an average improvement of 20.6% in F1-score compared to the latest competitors.
引用
收藏
页码:1680 / 1690
页数:11
相关论文
共 50 条
  • [21] Approximate Shortest Distance Queries with Advanced Graph Analytics over Large-scale Encrypted Graphs
    Luo, Yuchuan
    Wang, Dongsheng
    Fu, Shaojing
    Xu, Ming
    Chen, Yingwen
    Huang, Kai
    [J]. 2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 287 - 294
  • [22] Coupling graph perturbation theory with scalable parallel algorithms for large-scale enumeration of maximal cliques in biological graphs
    Samatova, N. F.
    Schmidt, M. C.
    Hendrix, W.
    Breimyer, P.
    Thomas, K.
    Park, B. -H.
    [J]. SCIDAC 2008: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2008, 125
  • [23] LARGE-SCALE COMMUNITY DETECTION ON SPEAKER CONTENT GRAPHS
    Shum, Stephen H.
    Campbell, William M.
    Reynolds, Douglas A.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7716 - 7720
  • [24] Graph partitioning MapReduce-based algorithms for counting triangles in large-scale graphs
    Sharafeldeen, Ahmed
    Alrahmawy, Mohammed
    Elmougy, Samir
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [25] Graph partitioning MapReduce-based algorithms for counting triangles in large-scale graphs
    Ahmed Sharafeldeen
    Mohammed Alrahmawy
    Samir Elmougy
    [J]. Scientific Reports, 13
  • [26] MDPCluster: a swarm-based community detection algorithm in large-scale graphs
    Shirjini, Mahsa Fozuni
    Farzi, Saeed
    Nikanjam, Amin
    [J]. COMPUTING, 2020, 102 (04) : 893 - 922
  • [27] MDPCluster: a swarm-based community detection algorithm in large-scale graphs
    Mahsa Fozuni Shirjini
    Saeed Farzi
    Amin Nikanjam
    [J]. Computing, 2020, 102 : 893 - 922
  • [28] An Efficient and Scalable Algorithmic Method for Generating Large-Scale Random Graphs
    Alam, Maksudul
    Khan, Maleq
    Vullikanti, Anil
    Marathe, Madhav
    [J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 372 - 383
  • [29] Top-k Community Similarity Search Over Large-Scale Road Networks
    Rai, Niranjan
    Lian, Xiang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10710 - 10721
  • [30] A community detection algorithm based on graph compression for large-scale social networks
    Zhao, Xingwang
    Liang, Jiye
    Wang, Jie
    [J]. INFORMATION SCIENCES, 2021, 551 : 358 - 372