DHS: Adaptive Memory Layout Organization of Sketch Slots for Fast and Accurate Data Stream Processing

被引:13
|
作者
Zhao, Bohan [1 ]
Li, Xiang [1 ]
Tian, Boyu [1 ]
Mei, Zhiyu [1 ]
Wu, Wenfei [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
关键词
Data stream processing; Approximate frequency estimation; Sketch; FREQUENT;
D O I
10.1145/3447548.3467353
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream processing is a crucial computation task in data mining applications. The rigid and fixed data structures in existing solutions limit their accuracy, throughput, and generality in measurement tasks. We propose Dynamic Hierarchical Sketch (DHS), a sketch-based hybrid solution targeting these properties. During the online stream processing, DHS hashes items to buckets and organizes cells in each bucket dynamically; the size of all cells in a bucket is adjusted adaptively to the actual size and distribution of flows. Thus, memory is efficiently used to precisely record elephant flows and cover more mice flows. Implementation and evaluation show that DHS achieves high accuracy, high throughput, and high generality on five measurement tasks: flow size estimation, flow size distribution estimation, heavy hitter detection, heavy changer detection, and entropy estimation.
引用
收藏
页码:2285 / 2293
页数:9
相关论文
共 50 条
  • [11] A fast and accurate threat detection and prevention architecture using stream processing
    Pastana Lobato, Antonio G.
    Lopez, Martin Andreoni
    Cardenas, Alvaro A.
    Duarte, Otto Carlos M. B.
    Pujolle, Guy
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (03):
  • [12] Load Adaptive Distributed Stream Processing System for Explosive Stream Data
    Lee, Myungcheol
    Lee, Miyoung
    Hur, Sung Jin
    Kim, Ikkyun
    [J]. 2015 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2015, : 753 - 757
  • [13] An adaptive query processing mechanism in data stream system
    Song, Baoyan
    Zhang, Lijie
    Yu, Ge
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 3113 - 3118
  • [14] Adaptive processing for continuous query over data stream
    Bae, Misook
    Hwang, Buhyun
    Nam, Jiseung
    [J]. PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS, 2007, 4742 : 347 - 358
  • [15] Data-Aware Adaptive Compression for Stream Processing
    Zhang, Yu
    Zhang, Feng
    Li, Hourun
    Zhang, Shuhao
    Guo, Xiaoguang
    Chen, Yuxing
    Pan, Anqun
    Du, Xiaoyong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (09) : 4531 - 4549
  • [16] DimensionSlice: A main-memory data layout for fast scans of multidimensional data
    Suh, Ilhyun
    Chung, Yon Dohn
    [J]. INFORMATION SYSTEMS, 2020, 94
  • [17] Adaptive Stream Query Processing Approach for Linked Stream Data: (Extended Abstract)
    Shamszaman, Zia Ush
    [J]. WEB REASONING AND RULE SYSTEMS, RR 2014, 2014, 8741 : 251 - 252
  • [18] A Predictive Scheduling Framework for Fast and Distributed Stream Data Processing
    Li, Teng
    Tang, Jian
    Xu, Jielong
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 333 - 338
  • [19] A fast processing algorithm on section disjoint query of data stream
    [J]. Wang, S. (wangshaopeng1984@163.com), 1600, Science Press (51):
  • [20] A Hybrid Method for Fast Predicate Matching in Data Stream Processing
    Kim, Hyeon Gyu
    Kim, Myoung Ho
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2012, 28 (02) : 365 - 377