Incremental Lossless Graph Summarization

被引:27
|
作者
Ko, Jihoon [1 ]
Kook, Yunbum [2 ]
Shin, Kijung [3 ]
机构
[1] Korea Adv Inst Sci & Technol, AI, Daejeon, South Korea
[2] Korea Adv Inst Sci & Technol, Dept Math Sci, Daejeon, South Korea
[3] Korea Adv Inst Sci & Technol, AI & EE, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
10.1145/3394486.3403074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a fully dynamic graph, represented as a stream of edge insertions and deletions, how can we obtain and incrementally update a lossless summary of its current snapshot? As large-scale graphs are prevalent, concisely representing them is inevitable for efficient storage and analysis. Lossless graph summarization is an effective graph-compression technique with many desirable properties. It aims to compactly represent the input graph as (a) a summary graph consisting of supernodes (i.e., sets of nodes) and superedges (i.e., edges between supernodes), which provide a rough description, and (b) edge corrections which fix errors induced by the rough description. While a number of batch algorithms, suited for static graphs, have been developed for rapid and compact graph summarization, they are highly inefficient in terms of time and space for dynamic graphs, which are common in practice. In this work, we propose MoSSo, the first incremental algorithm for lossless summarization of fully dynamic graphs. In response to each change in the input graph, MoSSo updates the output representation by repeatedly moving nodes among supernodes. MoSSo decides nodes to be moved and their destinations carefully but rapidly based on several novel ideas. Through extensive experiments on 10 real graphs, we show MoSSo is (a) Fast and `any time': processing each change in near-constant time (less than 0.1 millisecond), up to 7 orders of magnitude faster than running state-of-the-art batch methods, (b) Scalable: summarizing graphs with hundreds of millions of edges, requiring sub-linear memory during the process, and (c) Effective: achieving comparable compression ratios even to state-of-the-art batch methods.
引用
收藏
页码:317 / 327
页数:11
相关论文
共 50 条
  • [31] Multi-relation Graph Summarization
    Ke, Xiangyu
    Khan, Arijit
    Bonchi, Francesco
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (05)
  • [32] Abstractive summarization incorporating graph knowledge
    Li, Ping
    Yu, Jiong
    Chen, Jiayin
    Li, Min
    Yang, Dexian
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 63133 - 63147
  • [33] Fast and Accurate Graph Stream Summarization
    Gou, Xiangyang
    Zou, Lei
    Zhao, Chenxingyu
    Yang, Tong
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1118 - 1129
  • [34] TimeCrunch: Interpretable Dynamic Graph Summarization
    Shah, Neil
    Koutra, Danai
    Zou, Tianmin
    Gallagher, Brian
    Faloutsos, Christos
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1055 - 1064
  • [35] Graph Summarization for Preserving Spectral Characteristics
    Zhou, Houquan
    Liu, Shenghua
    Shen, Huawei
    Cheng, Xueqi
    [J]. PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, : 271 - 279
  • [36] Utility-Driven Graph Summarization
    Kumar, K. Ashwin
    Efstathopoulos, Petros
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 12 (04): : 335 - 347
  • [37] Quality metrics for RDF graph summarization
    Zneika, Mussab
    Vodislav, Dan
    Kotzinos, Dimitris
    [J]. SEMANTIC WEB, 2019, 10 (03) : 555 - 584
  • [38] Progress and Challenges of Graph Summarization Techniques
    Wang, Xiong
    Dong, Yihong
    Shi, Weijie
    Pan, Jianfei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (06): : 1338 - 1355
  • [39] An Integrated Graph Model for Document Summarization
    Yang, Kang
    Al-Sabahi, Kamal
    Xiang, Yanmin
    Zhang, Zuping
    [J]. INFORMATION, 2018, 9 (09)
  • [40] Graph Summarization with Controlled Utility Loss
    Hajiabadi, Mandi
    Singh, Jasbir
    Srinivasan, Venkatesh
    Thomo, Alex
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 536 - 546