Improved Frequency Estimation Algorithms with and without Predictions

被引:0
|
作者
Aamand, Anders [1 ]
Chen, Justin Y. [1 ]
Huy Le Nguyen [2 ]
Silwal, Sandeep [1 ]
Vakilian, Ali [3 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Northeastern Univ, Boston, MA 02115 USA
[3] TTIC, Chicago, IL USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
PROBABILITY-INEQUALITIES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating frequencies of elements appearing in a data stream is a key task in large-scale data analysis. Popular sketching approaches to this problem (e.g., CountMin and CountSketch) come with worst-case guarantees that probabilistically bound the error of the estimated frequencies for any possible input. The work of Hsu et al. (2019) introduced the idea of using machine learning to tailor sketching algorithms to the specific data distribution they are being run on. In particular, their learning-augmented frequency estimation algorithm uses a learned heavy-hitter oracle which predicts which elements will appear many times in the stream. We give a novel algorithm, which in some parameter regimes, already theoretically outperforms the learning based algorithm of Hsu et al. without the use of any predictions. Augmenting our algorithm with heavy-hitter predictions further reduces the error and improves upon the state of the art. Empirically, our algorithms achieve superior performance in all experiments compared to prior approaches.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Improved Method for Frequency Estimation of Sampled Sinusoidal Signals Without Iteration
    Park, Soon Young
    Song, Young Sub
    Kim, Hang Joon
    Park, Jongsik
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2011, 60 (08) : 2828 - 2834
  • [2] Motion Estimation Algorithms With and Without Interpolation
    Priyadarshini, K.
    Karthick, M.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (02): : 67 - 70
  • [3] Fast algorithms for single frequency estimation
    Klein, JD
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (05) : 1762 - 1770
  • [4] Joint Frequency and Time Estimation Algorithms
    Tayem, Nizar
    Raza, Syed Ahmed
    Omer, Muhammad
    Abul Hussain, Ahmed
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2016, 41 (09) : 3511 - 3519
  • [5] Joint Frequency and Time Estimation Algorithms
    Nizar Tayem
    Syed Ahmed Raza
    Muhammad Omer
    Ahmed Abul Hussain
    Arabian Journal for Science and Engineering, 2016, 41 : 3511 - 3519
  • [6] Improved boosting algorithms using confidence-rated predictions
    Schapire, RE
    Singer, Y
    MACHINE LEARNING, 1999, 37 (03) : 297 - 336
  • [7] Improved Boosting Algorithms Using Confidence-rated Predictions
    Robert E. Schapire
    Yoram Singer
    Machine Learning, 1999, 37 : 297 - 336
  • [8] Estimation of distribution algorithms without explicit selections
    Munetomo, M
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2004, : 80 - 85
  • [9] Learning Predictions for Algorithms with Predictions
    Khodak, Mikhail
    Balcan, Maria-Florina
    Talwalkar, Ameet
    Vassilvitskii, Sergei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [10] Improved gap size estimation for scaffolding algorithms
    Sahlin, Kristoffer
    Street, Nathaniel
    Lundeberg, Joakim
    Arvestad, Lars
    BIOINFORMATICS, 2012, 28 (17) : 2215 - 2222