DHS: Adaptive Memory Layout Organization of Sketch Slots for Fast and Accurate Data Stream Processing

被引:13
|
作者
Zhao, Bohan [1 ]
Li, Xiang [1 ]
Tian, Boyu [1 ]
Mei, Zhiyu [1 ]
Wu, Wenfei [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
关键词
Data stream processing; Approximate frequency estimation; Sketch; FREQUENT;
D O I
10.1145/3447548.3467353
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream processing is a crucial computation task in data mining applications. The rigid and fixed data structures in existing solutions limit their accuracy, throughput, and generality in measurement tasks. We propose Dynamic Hierarchical Sketch (DHS), a sketch-based hybrid solution targeting these properties. During the online stream processing, DHS hashes items to buckets and organizes cells in each bucket dynamically; the size of all cells in a bucket is adjusted adaptively to the actual size and distribution of flows. Thus, memory is efficiently used to precisely record elephant flows and cover more mice flows. Implementation and evaluation show that DHS achieves high accuracy, high throughput, and high generality on five measurement tasks: flow size estimation, flow size distribution estimation, heavy hitter detection, heavy changer detection, and entropy estimation.
引用
收藏
页码:2285 / 2293
页数:9
相关论文
共 50 条
  • [1] Augmented Sketch: Faster and More Accurate Stream Processing
    Roy, Pratanu
    Khan, Arijit
    Alonso, Gustavo
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1449 - 1463
  • [2] Rhombus sketch: adaptive and more accurate sketch for streaming data
    Wei X.-H.
    Miao Y.-W.
    Wang X.-W.
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (04): : 874 - 884
  • [3] SF-sketch: A Fast, Accurate, and Memory Efficient Data Structure to Store Frequencies of Data Items
    Yang, Tong
    Liu, Lingtong
    Yan, Yibo
    Shahzad, Muhammad
    Shen, Yulong
    Li, Xiaoming
    Cui, Bin
    Xie, Gaogang
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 103 - 106
  • [4] Fast and accurate stream processing by filtering the cold
    Tong Yang
    Jie Jiang
    Yang Zhou
    Long He
    Jinyang Li
    Bin Cui
    Steve Uhlig
    Xiaoming Li
    [J]. The VLDB Journal, 2019, 28 : 735 - 763
  • [5] Fast and accurate stream processing by filtering the cold
    Yang, Tong
    Jiang, Jie
    Zhou, Yang
    He, Long
    Li, Jinyang
    Cui, Bin
    Uhlig, Steve
    Li, Xiaoming
    [J]. VLDB JOURNAL, 2019, 28 (05): : 735 - 763
  • [6] NMMF-Stream: A Fast and Accurate Stream-Processing Scheme for Network Monitoring Data Recovery
    Xie, Kun
    Xie, Ruotian
    Wang, Xin
    Xie, Gaogang
    Zhang, Dafang
    Wen, Jigang
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022), 2022, : 2218 - 2227
  • [7] A Sketch Framework for Approximate Data Stream Processing in Sliding Windows
    Gou, Xiangyang
    Zhang, Yinda
    Hu, Zhoujing
    He, Long
    Wang, Ke
    Liu, Xilai
    Yang, Tong
    Wang, Yi
    Cui, Bin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 4411 - 4424
  • [8] Memory Renaming: Fast, Early and Accurate Processing of Memory Communication
    Gary S. Tyson
    Todd M. Austin
    [J]. International Journal of Parallel Programming, 1999, 27 : 357 - 380
  • [9] Memory renaming: Fast, early and accurate processing of memory communication
    Tyson, GS
    Austin, TM
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1999, 27 (05) : 357 - 380
  • [10] ADAPTIVE DISORDER CONTROL IN DATA STREAM PROCESSING
    Kim, Hyeon Gyu
    Kim, Cheolgi
    Kim, Myoung Ho
    [J]. COMPUTING AND INFORMATICS, 2012, 31 (02) : 393 - 410