SLUGGER: Lossless Hierarchical Summarization of Massive Graphs

被引:3
|
作者
Lee, Kyuhan [1 ]
Ko, Jihoon [1 ]
Shin, Kijung [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Kim Jaechul Grad Sch AI, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
COMPRESSION;
D O I
10.1109/ICDE53745.2022.00040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a massive graph, how can we exploit its hierarchical structure for concisely but exactly summarizing the graph? By exploiting the structure, can we achieve better compression rates than state-of-the-art graph summarization methods? The explosive proliferation of the Web has accelerated the emergence of large graphs, such as online social networks and hyperlink networks. Consequently, graph compression has become increasingly important to process such large graphs without expensive I/O over the network or to disk. Among a number of approaches, graph summarization, which in essence combines similar nodes into a supernode and describe their connectivity concisely, protrudes with several advantages. However, we note that it fails to exploit pervasive hierarchical structures of real-world graphs as its underlying representation model enforces supernodes to be disjoint. In this work, we propose the hierarchical graph summarization model, which is an expressive graph representation model that includes the previous one proposed by Navlakha et al. as a special case. The new model represents an unweighted graph using positive and negative edges between hierarchical supernodes, each of which can contain others. Then, we propose SLUGGER, a scalable heuristic for concisely and exactly representing a given graph under our new model. SLUGGER greedily merges nodes into supernodes while maintaining and exploiting their hierarchy, which is later pruned. SLUGGER significantly accelerates this process by sampling, approximation, and memoization. Our experiments on 16 real-world graphs show that SLUGGER is (a) Effective: yielding up to 29.6% more concise summary than state-of-the-art lossless summarization methods, (b) Fast: summarizing a graph with 0.8 billion edges in a few hours, and (c) Scalable: scaling linearly with the number of edges in the input graph.
引用
收藏
页码:472 / 484
页数:13
相关论文
共 50 条
  • [1] SSumM: Sparse Summarization of Massive Graphs
    Lee, Kyuhan
    Jo, Hyeonsoo
    Ko, Jihoon
    Lim, Sungsu
    Shin, Kijung
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 144 - 154
  • [2] SWeG: Lossless and Lossy Summarization of Web-Scale Graphs
    Shin, Kijung
    Ghoting, Amol
    Kim, Myunghwan
    Raghavan, Hema
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1679 - 1690
  • [3] A parameter-free approach to lossless summarization of fully dynamic graphs
    Ma, Ziyi
    Liu, Yuling
    Yang, Zhibang
    Yang, Jianye
    Li, Kenli
    [J]. INFORMATION SCIENCES, 2022, 589 : 376 - 394
  • [4] Summarization of Massive RDF Graphs Using Identifier Classification
    dos Santos, Andre Fernandes
    Leal, Jose Paulo
    [J]. GRAPH-BASED REPRESENTATION AND REASONING, ICCS 2023, 2023, 14133 : 89 - 103
  • [5] Sentiment Lossless Summarization
    Li, Xiaodong
    Wu, Pangjing
    Zou, Chenxin
    Xie, Haoran
    Wang, Fu Lee
    [J]. Knowledge-Based Systems, 2021, 227
  • [6] Sentiment Lossless Summarization
    Li, Xiaodong
    Wu, Pangjing
    Zou, Chenxin
    Xie, Haoran
    Wang, Fu Lee
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [7] Incremental Lossless Graph Summarization
    Ko, Jihoon
    Kook, Yunbum
    Shin, Kijung
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 317 - 327
  • [8] Hierarchical video summarization
    Ratakonda, K
    Sezan, MI
    Crinon, R
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1531 - 1541
  • [9] Lossless hierarchical storage of images
    Piscaglia, Patrik
    Macq, Benoit
    [J]. European transactions on telecommunications and related technologies, 1995, 6 (03): : 267 - 274
  • [10] Graph Summarization for Attributed Graphs
    Wu, Ye
    Zhong, Zhinong
    Xiong, Wei
    Jing, Ning
    [J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 502 - 506