Scalable Community Search over Large-scale Graphs based on Graph Transformer

被引:0
|
作者
Wang, Yuxiang [1 ]
Gou, Xiaoxuan [1 ]
Xu, Xiaoliang [1 ]
Geng, Yuxia [1 ]
Ke, Xiangyu [2 ]
Wu, Tianxing [3 ]
Yu, Zhiyuan [1 ]
Chen, Runhuai [1 ]
Wu, Xiangying [1 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Southeast Univ, Nanjing, Jiangsu, Peoples R China
关键词
Community Search; Graph Transformer; EFFICIENT; INFORMATION;
D O I
10.1145/3626772.3657771
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a graph G and a query node q, community search (CS) aims to find a structurally cohesive subgraph from G that contains q CS is widely used in many real-world applications, such as online recommendation and expert finding. Recently, the rise of learning-based CS methods has garnered extensive research interests, showcasing the promising potential of neural solutions. However, there remains room for optimization: (1) They initialize node features via classical methods, e.g., one-hot, random, and position encoding, which may fall short in capturing valuable community cohesiveness-related features. (2) The reliance on GCN or GCN-like models poses challenges in scaling to large graphs. (3) Existing methods do not adapt well to dynamic graphs, often requiring retraining from scratch. To handle this, we present CSFormer, a scalable CS based on Graph Transformer. First, we present a novel l -hop neighborhood community vector based on n-order h-index to represent each node's community features, generating a sequence of feature vectors by varying the neighborhood scope l Then, we build a Transformer backbone to learn a good graph embedding that carries rich community features, based on which we perform a prediction-filteringbased online CS to efficiently return a community of q We extend CSFormer to dynamic graphs and various community models. Extensive experiments on seven real-world graphs showour solution's superiority on effectiveness, e.g., we attain an average improvement of 20.6% in F1-score compared to the latest competitors.
引用
收藏
页码:1680 / 1690
页数:11
相关论文
共 50 条
  • [31] Highly Scalable Large-Scale Asynchronous Graph Processing using Actors
    Elmougy, Youssef
    Hayashi, Akihiro
    Sarkar, Vivek
    [J]. Proceedings - 23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing Workshops, CCGridW 2023, 2023, : 242 - 248
  • [32] Highly Scalable Large-Scale Asynchronous Graph Processing using Actors
    Elmougy, Youssef
    Hayashi, Akihiro
    Sarkar, Vivek
    [J]. 2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 242 - 248
  • [33] Self-Supervised Graph Transformer on Large-Scale Molecular Data
    Rong, Yu
    Bian, Yatao
    Xu, Tingyang
    Xie, Weiyang
    Wei, Ying
    Huang, Wenbing
    Huang, Junzhou
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [34] Large-Scale Visual Search with Binary Distributed Graph at Alibaba
    Zhao, Kang
    Pan, Pan
    Zheng, Yun
    Zhang, Yanhao
    Wang, Changxu
    Zhang, Yingya
    Xu, Yinghui
    Jin, Rong
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2567 - 2575
  • [35] Heterogeneous Graph Propagation for Large-Scale Web Image Search
    Xie, Lingxi
    Tian, Qi
    Zhou, Wengang
    Zhang, Bo
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 4287 - 4298
  • [36] Effective and Efficient Community Search Over Large Directed Graphs
    Fang, Yixiang
    Wang, Zhongran
    Cheng, Reynold
    Wang, Hongzhi
    Hu, Jiafeng
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (11) : 2093 - 2107
  • [37] Effective and Efficient Community Search over Large Directed Graphs
    Fang, Yixiang
    Wang, Zhongran
    Cheng, Reynold
    Wang, Hongzhi
    Hu, Jiafeng
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 2157 - 2158
  • [38] Scalable Graph Similarity Search in Large Graph Databases
    Kiran, P.
    Sivadasan, Naveen
    [J]. PROCEEDINGS OF THE 2015 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2015, : 207 - 211
  • [39] Scalable Implementation of a MapReduce-based Graph Processing Algorithm for Large-scale Heterogeneous Supercomputers
    Shirahata, Koichi
    Sato, Hitoshi
    Suzumura, Toyotaro
    Matsuoka, Satoshi
    [J]. PROCEEDINGS OF THE 2013 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID 2013), 2013, : 277 - 284
  • [40] Large-scale quantum networks based on graphs
    Epping, Michael
    Kampermann, Hermann
    Bruss, Dagmar
    [J]. NEW JOURNAL OF PHYSICS, 2016, 18