Augmented Sketch: Faster and More Accurate Stream Processing

被引:110
|
作者
Roy, Pratanu [1 ,3 ]
Khan, Arijit [2 ]
Alonso, Gustavo [1 ]
机构
[1] Swiss Fed Inst Technol, Syst Grp Comp Sci, Zurich, Switzerland
[2] NTU Singapore, Sch Comp Engn, Singapore, Singapore
[3] Oracle Labs, Zurich, Switzerland
关键词
data streams; sketch; approximated algorithms; data structures; stream summary;
D O I
10.1145/2882903.2882948
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Approximated algorithms are often used to estimate the frequency of items on high volume, fast data streams. The most common ones are variations of Count-Min sketch, which use sub-linear space for the count, but can produce errors in the counts of the most frequent items and can misclassify low-frequency items. In this paper, we improve the accuracy of sketch-based algorithms by increasing the frequency estimation accuracy of the most frequent items and reducing the possible misclassification of low-frequency items, while also improving the overall throughput. Our solution, called Augmented Sketch (ASketch), is based on a pre-filtering stage that dynamically identifies and aggregates the most frequent items. Items overflowing the pre-filtering stage are processed using a conventional sketch algorithm, thereby making the solution general and applicable in a wide range of contexts. The pre-filtering stage can be efficiently implemented with S I MD instructions on multi-core machines and can be further parallelized through pipeline parallelism where the filtering stage runs in one core and the sketch algorithm runs in another core.
引用
收藏
页码:1449 / 1463
页数:15
相关论文
共 50 条
  • [31] In the Real World, Faster Diagnoses Are Not Necessarily More Accurate
    Reilly, James B.
    Von Feldt, Joan M.
    [J]. ACADEMIC MEDICINE, 2013, 88 (03) : 297 - 298
  • [32] Solid phase extraction faster, more accurate and reliable
    [J]. Res Dev (Barrington IL), 2 (26):
  • [33] A Sketch Framework for Approximate Data Stream Processing in Sliding Windows
    Gou, Xiangyang
    Zhang, Yinda
    Hu, Zhoujing
    He, Long
    Wang, Ke
    Liu, Xilai
    Yang, Tong
    Wang, Yi
    Cui, Bin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 4411 - 4424
  • [34] Fast and accurate stream processing by filtering the cold
    Tong Yang
    Jie Jiang
    Yang Zhou
    Long He
    Jinyang Li
    Bin Cui
    Steve Uhlig
    Xiaoming Li
    [J]. The VLDB Journal, 2019, 28 : 735 - 763
  • [35] Fast and accurate stream processing by filtering the cold
    Yang, Tong
    Jiang, Jie
    Zhou, Yang
    He, Long
    Li, Jinyang
    Cui, Bin
    Uhlig, Steve
    Li, Xiaoming
    [J]. VLDB JOURNAL, 2019, 28 (05): : 735 - 763
  • [36] The Future Production Measurement Technology: faster, safer, more accurate and more flexible
    Schmitt, Robert
    Imkamp, Dietrich
    [J]. ATP EDITION, 2011, (06): : 12 - 13
  • [37] Can Artificial Intelligence Make Screening Faster, More Accurate, and More Accessible?
    Li, Zhixi
    Keel, Stuart
    Liu, Chi
    He, Mingguang
    [J]. ASIA-PACIFIC JOURNAL OF OPHTHALMOLOGY, 2018, 7 (06): : 436 - 441
  • [38] In the Real World, Faster Diagnoses Are Not Necessarily More Accurate Reply
    Sherbino, Jonathan
    Norman, Geoffrey R.
    [J]. ACADEMIC MEDICINE, 2013, 88 (03) : 298 - 298
  • [39] Faster, more accurate diffusion filtering for fetal ultrasound volumes
    Kim, Min-Jeong
    Yun, Hyun-Joo
    Kim, Myoung-Hee
    [J]. IMAGE ANALYSIS AND RECOGNITION, PT 2, 2006, 4142 : 524 - 534
  • [40] FASTER AND MORE ACCURATE GRINDING OF FLEXIBLE PARTS IN CYLINDRICAL GRINDERS
    SOBOLER, SM
    OSTRENTO, AN
    [J]. SOVIET ENGINEERING RESEARCH, 1982, 2 (11): : 73 - 75