Auxo: A Scalable and Efficient Graph Stream Summarization Structure

被引:3
|
作者
Jiang, Zhiguo [1 ]
Chen, Hanhua [1 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst Service, Serv Comp Technol & Syst Lab,Cluster & Grid Comp, Wuhan 430074, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 16卷 / 06期
关键词
SKETCHES;
D O I
10.14778/3583140.3583154
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A graph stream refers to a continuous stream of edges, forming a huge and fast-evolving graph. The vast volume and high update speed of a graph stream bring stringent requirements for the data management structure, including sublinear space cost, computation-efficient operation support, and scalability of the structure. Existing designs summarize a graph stream by leveraging a hash-based compressed matrix and representing an edge using its fingerprint to achieve practical storage for a graph stream with a known upper bound of data volume. However, they fail to support the dynamically extending of graph streams. In this paper, we propose Auxo, a scalable structure to support space/time efficient summarization of dynamic graph streams. Auxo is built on a proposed novel prefix embedded tree (PET) which leverages binary logarithmic search and common binary prefixes embedding to provide an efficient and scalable tree structure. PET reduces the item insert/query time from O(vertical bar E vertical bar) to O(log vertical bar E vertical bar) as well as reducing the total storage cost by a log vertical bar E vertical bar scale, where vertical bar E vertical bar is the size of the edge set in a graph stream. To further improve the memory utilization of PET during scaling, we propose a proportional PET structure that extends a higher level in a proportionally incremental style. We conduct comprehensive experiments on large-scale real-world datasets to evaluate the performance of this design. Results show that Auxo significantly reduces the insert and query time by one to two orders of magnitude compared to the state of the arts. Meanwhile, Auxo achieves efficiently and economically structure scaling with an average memory utilization of over 80%.
引用
收藏
页码:1386 / 1398
页数:13
相关论文
共 50 条
  • [1] Horae: A Graph Stream Summarization Structure for Efficient Temporal Range Query
    Chen, Ming
    Zhou, Renxiang
    Chen, Hanhua
    Xiao, Jiang
    Jin, Hai
    Li, Bo
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2792 - 2804
  • [2] Graph Stream Summarization
    Tang, Nan
    Chen, Qing
    Mitra, Prasenjit
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1481 - 1496
  • [3] Scalable Dynamic Graph Summarization
    Tsalouchidou, Ioanna
    Morales, Giamnarco De Francisci
    Bonchi, Francesco
    Baeza-Yates, Ricardo
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1032 - 1039
  • [4] Scalable Dynamic Graph Summarization
    Tsalouchidou, Ioanna
    Bonchi, Francesco
    Morales, Gianmarco De Francisci
    Baeza-Yates, Ricardo
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (02) : 360 - 373
  • [5] Auxo: Efficient Federated Learning via Scalable Client Clustering
    Liu, Jiachen
    Lai, Fan
    Dai, Yinwei
    Akella, Aditya
    Madhyastha, Harsha V.
    Chowdhury, Mosharaf
    [J]. PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON CLOUD COMPUTING, SOCC 2023, 2023, : 125 - 141
  • [6] Fast and Accurate Graph Stream Summarization
    Gou, Xiangyang
    Zou, Lei
    Zhao, Chenxingyu
    Yang, Tong
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1118 - 1129
  • [7] A new method for graph stream summarization based on both the structure and concepts
    Ashrafi-Payaman, Nosratali
    Kangavari, Mohammad Reza
    Fander, Amir Mohammad
    [J]. OPEN ENGINEERING, 2019, 9 (01): : 500 - 511
  • [8] Cuckoo Matrix: A High Efficient and Accurate Graph Stream Summarization on Limited Memory
    Li, Zhuo
    Li, Zhuoran
    Fan, Zhiyuan
    Zhao, Jianli
    Zeng, Siming
    Luo, Peng
    Liu, Kaihua
    [J]. ELECTRONICS, 2023, 12 (02)
  • [9] GS4: Graph stream summarization based on both the structure and semantics
    Ashrafi-Payaman, Nosratali
    Kangavari, Mohammad Reza
    Hosseini, Saeid
    Fander, Amir Mohammad
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (03): : 2713 - 2733
  • [10] GS4: Graph stream summarization based on both the structure and semantics
    Nosratali Ashrafi-Payaman
    Mohammad Reza Kangavari
    Saeid Hosseini
    Amir Mohammad Fander
    [J]. The Journal of Supercomputing, 2021, 77 : 2713 - 2733