Persistent graph stream summarization for real-time graph analytics

被引:0
|
作者
Yan Jia
Zhaoquan Gu
Zhihao Jiang
Cuiyun Gao
Jianye Yang
机构
[1] Harbin Institute of Technology,School of Computer Science and Technology
[2] Hunan University,College of Computer Sience and Electronic Engineering
[3] Guangzhou University,Cyberspace Institute of Advanced Technology
来源
World Wide Web | 2023年 / 26卷
关键词
Graph sketch; Graph summarization; Query processing;
D O I
暂无
中图分类号
学科分类号
摘要
In massive and rapid graph streams, a useful and important task is to summarize the structure of graph streams in order to enable efficient and effective graph query processing. Although this task has been extensively studied in the literature, we observe that the existing solutions for graph sketches can only answer queries about the current status of the graph stream. In this paper, we aim at designing persistent graph sketches to support graph queries in any given time range in the past. To this end, we first introduce a baseline method by extending an existing graph summarization method. However, our empirical study suggests that the accuracy performance of the baseline method is unsatisfactory, especially when the query time interval is large. To tackle this issue, we propose a new method PGSS-BDH, which stores the streaming edges using a set of hierarchically organized hashmaps. When a query arrives, we divide the query time interval into a set of disjoint sub-intervals and return the sum of query results on all sub-intervals as the overall query answer. Observing that PGSS-BDH bears a linear space cost to the graph stream size, we further propose an advance method PGSS-MDC by using a set of fixed-size hierarchical counters to store the weight of edges, where the query processing is similar to PGSS-BDH. We theoretically analyze the accuracy performance of PGSS-BDH and PGSS-MDC. The experiment results on real-life datasets demonstrate that PGSS-MDC can return much more accurate answers than the competitors by consuming comparable query time and much less memory.
引用
收藏
页码:2647 / 2667
页数:20
相关论文
共 50 条
  • [1] Persistent graph stream summarization for real-time graph analytics
    Jia, Yan
    Gu, Zhaoquan
    Jiang, Zhihao
    Gao, Cuiyun
    Yang, Jianye
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (05): : 2647 - 2667
  • [2] SIM: A fast real-time graph stream summarization with improved memory efficiency and accuracy
    Li, Zhuo
    Liu, Shuaijun
    Liu, Jindian
    Zhang, Yu
    Liang, Teng
    Liu, Kaihua
    COMPUTER NETWORKS, 2024, 248
  • [3] Sim: A Fast Real-Time Graph Stream Summarization with Improved Memory Efficiency and Accuracy
    Li, Zhuo
    Liu, Shuaijun
    Liu, Jindian
    Zhang, Yu
    Liang, Teng
    Liu, Kaihua
    SSRN,
  • [4] Graph Stream Summarization
    Tang, Nan
    Chen, Qing
    Mitra, Prasenjit
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1481 - 1496
  • [5] Real-Time Streaming Intelligence: Integrating Graph and NLP Analytics
    Ediger, David
    Appling, Scott
    Briscoe, Erica
    McColl, Rob
    Poovey, Jason
    2014 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2014,
  • [6] Fast and Accurate Graph Stream Summarization
    Gou, Xiangyang
    Zou, Lei
    Zhao, Chenxingyu
    Yang, Tong
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1118 - 1129
  • [7] Hashing for Adaptive Real-Time Graph Stream Classification With Concept Drifts
    Chi, Lianhua
    Li, Bin
    Zhu, Xingquan
    Pan, Shirui
    Chen, Ling
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (05) : 1591 - 1604
  • [8] Graph Analytics for Real-Time Scoring of Cross-Channel Transactional Fraud
    Molloy, Ian
    Chari, Suresh
    Finkler, Ulrich
    Wiggerman, Mark
    Jonker, Coen
    Habeck, Ted
    Park, Youngja
    Jordens, Frank
    van Schaik, Ron
    FINANCIAL CRYPTOGRAPHY AND DATA SECURITY, FC 2016, 2017, 9603 : 22 - 40
  • [9] Interactive Graph Stream Analytics in Arkouda
    Du, Zhihui
    Rodriguez, Oliver Alvarado
    Patchett, Joseph
    Bader, David A.
    ALGORITHMS, 2021, 14 (08)
  • [10] Real-time Traffic Jam Detection and Congestion Reduction Using Streaming Graph Analytics
    Abbas, Zainab
    Sottovia, Paolo
    Hassan, Mohamad Al Hajj
    Foroni, Daniele
    Bortoli, Stefano
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3109 - 3118