Probabilistic lossy counting: An efficient algorithm for finding heavy hitters

被引:0
|
作者
Dimitropoulos, Xenofontas [1 ]
Hurley, Paul [1 ]
Kind, Andreas [1 ]
机构
[1] IBM Zurich Res Lab, Zurich, Switzerland
关键词
algorithms; measurement; performance; heavy hitters; data streams;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge of the largest traffic flows in a network is important for many network management applications. The problem of finding these flows is known as the heavy-hitter problem and has been the subject of many studies in the past years. One of the most efficient and well-known algorithms for finding heavy hitters is lossy counting [29]. In this work we introduce probabilistic lossy counting (PLC) which enhances lossy counting in computing network traffic heavy hitters. PLC uses on a tighter error bound on the estimated sizes of traffic flows and provides probabilistic rather than deterministic guarantees on its accuracy. The probabilistic-based error bound substantially improves the memory consumption of the algorithm. In addition, PLC reduces the rate of false positives of lossy counting and achieves a low estimation error, although slightly higher than that of lossy counting. We compare PLC with state-of-the-art algorithms for finding heavy hitters. Our experiments using real traffic traces find that PLC has 1) between 34.4% and 74% lower memory consumption, 2) between 37.9% and 40.5% fewer false positives than lossy counting, and 3) a small estimation error.
引用
收藏
页码:7 / 16
页数:10
相关论文
共 50 条
  • [41] An efficient algorithm for finding ideal schedules
    Edward G. Coffman
    Dariusz Dereniowski
    Wiesław Kubiak
    [J]. Acta Informatica, 2012, 49 : 1 - 14
  • [42] An efficient algorithm for finding ideal schedules
    Coffman, Edward G., Jr.
    Dereniowski, Dariusz
    Kubiak, Wieslaw
    [J]. ACTA INFORMATICA, 2012, 49 (01) : 1 - 14
  • [43] AN EFFICIENT ALGORITHM FOR RHYTHM-FINDING
    CHIN, F
    WU, S
    [J]. COMPUTER MUSIC JOURNAL, 1992, 16 (02) : 35 - 44
  • [44] Finding by Counting: A Probabilistic Packet Count Model for Indoor Localization in BLE Environments
    De, Subham
    Chowdhary, Shreyans
    Shirke, Aniket
    Lo, Yat Long
    Kravets, Robin
    Sundaram, Hari
    [J]. PROCEEDINGS OF THE 11TH WORKSHOP ON WIRELESS NETWORK TESTBEDS, EXPERIMENTAL EVALUATION & CHARACTERIZATION (WINTECH '17), 2017, : 67 - 74
  • [45] A LINEAR-TIME PROBABILISTIC COUNTING ALGORITHM FOR DATABASE APPLICATIONS
    WHANG, KY
    VANDERZANDEN, BT
    TAYLOR, HM
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 1990, 15 (02): : 208 - 229
  • [47] An Efficient Algorithm for Counting Markov Equivalent DAGs
    Ganian, Robert
    Hamm, Thekla
    Talvitie, Topi
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10136 - 10143
  • [48] An efficient algorithm for counting Markov equivalent DAGs
    Ganian, Robert
    Hamm, Thekla
    Talvitie, Topi
    [J]. ARTIFICIAL INTELLIGENCE, 2022, 304
  • [49] PhyME: A probabilistic algorithm for finding motifs in sets of orthologous sequences
    Sinha, S
    Blanchette, M
    Tompa, M
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [50] PhyME: A probabilistic algorithm for finding motifs in sets of orthologous sequences
    Saurabh Sinha
    Mathieu Blanchette
    Martin Tompa
    [J]. BMC Bioinformatics, 5