A Cloud-Based Parallel Space-Saving Algorithm for Big Networking Data

被引:0
|
作者
He, Dazhong [1 ]
Yang, Yang
Liu, Jun
机构
[1] Beijing Univ Posts & Telecommun, Ctr Data Sci, Beijing 100876, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Network traffic; Top-k; streaming algorithm; parallel algorithm; STREAMS; FREQUENT; ELEMENTS;
D O I
10.1109/ACCESS.2018.2865745
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the network continues to evolve, completely analyzing the traffic requires immeasurable resources. In situations of processing enormous streaming data, the most significant k items (Top-k) are more interesting, and some streaming algorithms are deployed due to relatively limited memory and also limited processing time per item. Space-saving is such one of the most popular algorithms for computation of frequent and Top-k elements in data streams. In this paper, this algorithm is implemented in the cloud for analyzing big networking data, and an empirical formula of the counter number is derived for efficiently maintaining Top-k items. Meanwhile, easily understandable proof manner is presented to prove the merging ability of Space-saving algorithm, and some experiments are conducted to affirm the effectiveness of the algorithm.
引用
收藏
页码:45886 / 45898
页数:13
相关论文
共 50 条
  • [1] Cloud-based computation and networking for space
    Carver, Brett
    Esposito, Timothy
    Lyke, James
    [J]. OPEN ARCHITECTURE/OPEN BUSINESS MODEL NET-CENTRIC SYSTEMS AND DEFENSE TRANSFORMATION 2019, 2019, 11015
  • [2] CUDA Based Parallel Implementations of Space-Saving on a GPU
    Cafaro, Massimo
    Epicoco, Italo
    Aloisio, Giovanni
    Pulimeno, Marco
    [J]. 2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 707 - 714
  • [3] A Space-Saving Approximation Algorithm for Grammar-Based Compression
    Sakamoto, Hiroshi
    Maruyama, Shirou
    Kida, Takuya
    Shimozono, Shinichi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (02): : 158 - 165
  • [4] A SPACE-SAVING MODIFICATION OF DAVIDSON EIGENVECTOR ALGORITHM
    VANLENTHE, JH
    PULAY, P
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 1990, 11 (10) : 1164 - 1168
  • [5] Novel dynamic load balancing algorithm for cloud-based big data analytics
    Arman Aghdashi
    Seyedeh Leili Mirtaheri
    [J]. The Journal of Supercomputing, 2022, 78 : 4131 - 4156
  • [6] Performance prediction of parallel computing models to analyze cloud-based big data applications
    Shen, Chao
    Tong, Weiqin
    Choo, Kim-Kwang Raymond
    Kausar, Samina
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (02): : 1439 - 1454
  • [7] Performance prediction of parallel computing models to analyze cloud-based big data applications
    Chao Shen
    Weiqin Tong
    Kim-Kwang Raymond Choo
    Samina Kausar
    [J]. Cluster Computing, 2018, 21 : 1439 - 1454
  • [8] CADRE: A Cloud-Based Data Service for Big Bibliographic Data
    Yan, Xiaoran
    Ruan, Guangchen
    Nikolov, Dimitar
    Hutchinson, Matthew
    Kankanamalage, Chathuri Peli
    Serrette, Ben
    McCombs, James
    Walsh, Alan
    Tuna, Esen
    Pentchev, Valentin
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4283 - 4292
  • [9] FaStore: a space-saving solution for raw sequencing data
    Roguski, Lukasz
    Ochoa, Idoia
    Hernaez, Mikel
    Deorowicz, Sebastian
    [J]. BIOINFORMATICS, 2018, 34 (16) : 2748 - 2756
  • [10] Cloud-based difference algorithm using big GPR data for roadbed damage detection
    Xu, Xianlei
    Gao, Wenru
    Zhang, Di
    Li, Taotao
    Qiao, Xu
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (23):