A distributed streaming framework for edge-cloud triangle counting in graph streams

被引:0
|
作者
Yang, Xu [1 ]
Song, Chao [2 ]
Gu, Jiqing [3 ]
Li, Ke [1 ]
Li, Hongwei [2 ]
机构
[1] Xian Univ Technol, Sch Engn & Comp Sci, Xian 710049, Shaanxi, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610731, Sichuan, Peoples R China
[3] Chengdu Univ Informat Technol, Sch Comp Sci, Chengdu 610225, Sichuan, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Triangle counting; Approximate algorithms; Streaming graphs; Distributed streaming algorithms; ALGORITHMS;
D O I
10.1016/j.knosys.2023.110878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The triangle counting problem in graph streams has been extensively studied in social network analysis, recommendation systems, user portraits and other fields. However, cloud computing based streaming algorithms cause high bandwidth occupation and long transmission latency due to limited bandwidth of the cloud. Recently, edge computing is promising to overcome the issue of transmitting large-scale data for cloud computing. However, directly applying edge computing in streaming triangle counting will reduce the accuracy of the triangle count estimation, due to the limitation of local computing at the edge network. We term the cooperations between edge computing and cloud computing for streaming triangle counting as edge-cloud triangle counting in graph streams. In this paper, we first propose a streaming framework for edge-cloud triangle counting in graph streams. Then, we propose a streaming triangle counting algorithm called Trie-based Edge Compression (TbEC) by using the binary trie at the edge network that enables lossless compression and efficient transmission to the cloud. In addition, to extend our algorithms for triangle counting in multigraphs, we present a dual deduplication strategy collaboratively using the trie-based data structure and a Bloom Filter. Our experiments with real-world datasets show that TbEC is (a) Accurate: yielding up to 3.35x more accurate smaller estimation error than the state-of-the-art distributed streaming algorithm, (b) Fast: yielding up to 10.59x faster than the state-of-the-art distributed streaming algorithm, (c) Scalable: scaling linearly with the number of edges in the input graph stream.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] CoCoS: Fast and Accurate Distributed Triangle Counting in Graph Streams
    Shin, Kijung
    Lee, Euiwoong
    Oh, Jinoh
    Hammoud, Mohammad
    Faloutsos, Christos
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (03)
  • [2] Triangle Counting in Dynamic Graph Streams
    Laurent Bulteau
    Vincent Froese
    Konstantin Kutzkov
    Rasmus Pagh
    [J]. Algorithmica, 2016, 76 : 259 - 278
  • [3] Triangle Counting in Dynamic Graph Streams
    Kutzkov, Konstantin
    Pagh, Rasmus
    [J]. ALGORITHM THEORY - SWAT 2014, 2014, 8503 : 306 - 318
  • [4] Triangle Counting in Dynamic Graph Streams
    Bulteau, Laurent
    Froese, Vincent
    Kutzkov, Konstantin
    Pagh, Rasmus
    [J]. ALGORITHMICA, 2016, 76 (01) : 259 - 278
  • [5] An Edge-Cloud Collaboration Framework for Graph Processing in Smart Society
    Zhou, Jun
    Kondo, Masaaki
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2023, 11 (04) : 985 - 1001
  • [6] Efficient RDF Streaming for the Edge-Cloud Continuum
    Sowinski, Piotr
    Wasielewska-Michniewska, Katarzyna
    Ganzha, Maria
    Pawlowski, Wieslaw
    Szmeja, Pawel
    Paprzycki, Marcin
    [J]. 2022 IEEE 8TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2022,
  • [7] Streaming Analytics in Edge-Cloud Environment for Logistics Processes
    von Stietencron, Moritz
    Lewandowski, Marco
    Lepenioti, Katerina
    Bousdekis, Alexandros
    Hribernik, Karl
    Apostolou, Dimitris
    Mentzas, Gregoris
    [J]. ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS: TOWARDS SMART AND DIGITAL MANUFACTURING, PT II, 2020, 592 : 245 - 253
  • [8] How the Degeneracy Helps for Triangle Counting in Graph Streams
    Bera, Suman K.
    Seshadhri, C.
    [J]. PODS'20: PROCEEDINGS OF THE 39TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2020, : 457 - 467
  • [9] DRL-Based Distributed Task Offloading Framework in Edge-Cloud Environment
    Nashaat, Heba
    Hashem, Walaa
    Rizk, Rawya
    Attia, Radwa
    [J]. IEEE ACCESS, 2024, 12 : 33580 - 33594
  • [10] Distributed Dataflow Across the Edge-Cloud Continuum
    Ekaireb, Tyler
    Brand, Lukas
    Avaraddy, Nagarjun
    Mock, Markus
    Krintz, Chandra
    Wolski, Rich
    [J]. 2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 316 - 327