Improved Frequency Estimation Algorithms with and without Predictions

被引:0
|
作者
Aamand, Anders [1 ]
Chen, Justin Y. [1 ]
Huy Le Nguyen [2 ]
Silwal, Sandeep [1 ]
Vakilian, Ali [3 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Northeastern Univ, Boston, MA 02115 USA
[3] TTIC, Chicago, IL USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
PROBABILITY-INEQUALITIES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating frequencies of elements appearing in a data stream is a key task in large-scale data analysis. Popular sketching approaches to this problem (e.g., CountMin and CountSketch) come with worst-case guarantees that probabilistically bound the error of the estimated frequencies for any possible input. The work of Hsu et al. (2019) introduced the idea of using machine learning to tailor sketching algorithms to the specific data distribution they are being run on. In particular, their learning-augmented frequency estimation algorithm uses a learned heavy-hitter oracle which predicts which elements will appear many times in the stream. We give a novel algorithm, which in some parameter regimes, already theoretically outperforms the learning based algorithm of Hsu et al. without the use of any predictions. Augmenting our algorithm with heavy-hitter predictions further reduces the error and improves upon the state of the art. Empirically, our algorithms achieve superior performance in all experiments compared to prior approaches.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] AN IMPROVED MDCT DOMAIN FREQUENCY ESTIMATION METHOD
    Dun, Yujie
    Liu, Guizhong
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 120 - 123
  • [42] Viewpoint Algorithms with Predictions
    Mitzenmacher, Michael
    Vassilvitskii, Sergei
    COMMUNICATIONS OF THE ACM, 2022, 65 (07) : 33 - 35
  • [43] Water-Retention Curves of Coarse Soils Without Organic Matter: Improved Data for Improved Predictions
    Chapuis, Robert P.
    Masse, Isabelle
    Madinier, Benedicte
    Duhaiome, Francois
    GEOTECHNICAL TESTING JOURNAL, 2015, 38 (03): : 325 - 337
  • [44] An Improved Rife Algorithm of Frequency Estimation for Frequency-Hopping Signal
    Lv, Jun
    Yun, Leying
    Li, Tong
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT II, 2016, 9772 : 181 - 191
  • [45] Hybridization Framework for Improved Dynamic Phasor Parameter Estimation Algorithms
    Qian, Cheng
    Kezunovic, Mladen
    2019 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE (ISGT), 2019,
  • [46] Improved genetic algorithms for varying parameter estimation in nonlinear system
    Gao, Tiehong
    Li, Chongxiao
    Han, Yanfang
    Tao, Mei
    Shu Ju Cai Ji Yu Chu Li/Journal of Data Acquisition and Processing, 2002, 17 (03):
  • [47] An Improved Estimation of Distribution Algorithms based on the Minimal Free Energy
    Yu, Fahong
    Chen, Meijia
    Liao, Weizhi
    MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 1093 - +
  • [48] Air quality predictions of the urban airshed model containing improved advection and chemistry algorithms
    Winkler, SL
    Chock, DP
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 1996, 30 (04) : 1163 - 1175
  • [49] Estimation of the Ionosphere Critical Frequency Without Radio Sounding
    Potapov, Alexander S.
    Polyushkina, Tatyana N.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (07): : 5058 - 5065
  • [50] Frequency estimation of undamped exponential signals using genetic algorithms
    Mitra, Amit
    Kundu, Debasis
    Agrawal, Gunjan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (03) : 1965 - 1985