Recursive Quantile Estimation: Non-Asymptotic Confidence Bounds

被引:0
|
作者
Chen, Likai [1 ]
Keilbar, Georg [2 ]
Wu, Wei Biao [3 ]
机构
[1] Washington Univ, Dept Math & Stat, St Louis, MO 63130 USA
[2] Univ Vienna, Dept Stat & Operat Res, Vienna, Austria
[3] Univ Chicago, Dept Stat, Chicago, IL USA
关键词
Finite sample bounds; quantiles; stochastic gradient descent; Polyak-Ruppert averaging; recursive estimation; STOCHASTIC-APPROXIMATION; HILBERT-SPACES; REGRESSION; INFERENCE; RISK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers the recursive estimation of quantiles using the stochastic gradient descent (SGD) algorithm with Polyak-Ruppert averaging. The algorithm offers a computationally and memory efficient alternative to the usual empirical estimator. Our focus is on studying the non-asymptotic behavior by providing exponentially decreasing tail probability bounds under mild assumptions on the smoothness of the density functions. This novel non-asymptotic result is based on a bound of the moment generating function of the SGD estimate. We apply our result to the problem of best arm identification in a multi-armed stochastic bandit setting under quantile preferences.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Non-asymptotic Delay Bounds for Networks with Heavy-Tailed Traffic
    Liebeherr, Joerg
    Burchard, Almut
    Ciucu, Florin
    2010 PROCEEDINGS IEEE INFOCOM, 2010,
  • [42] Non-asymptotic convergence bounds for Wasserstein approximation using point clouds
    Merigot, Quentin
    Santambrogio, Filippo
    Sarrazin, Clement
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [43] Non-asymptotic performance bounds for downlink MU-MIMO scheduling
    Zois, Georgios
    Michaloliakos, Antonios
    Psounis, Konstantinos
    Vassalos, Vasilis
    Mourtos, Ioannis
    2016 12TH ANNUAL CONFERENCE ON WIRELESS ON-DEMAND NETWORK SYSTEMS AND SERVICES (WONS), 2016, : 162 - 169
  • [44] Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs
    Simchowitz, Max
    Jamieson, Kevin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [45] Non-asymptotic upper bounds on the probability of the ε-atypical set for Markov chains
    Lastras-Montaño, LA
    2004 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2004, : 222 - 222
  • [46] Non-Asymptotic Bounds on Fixed Length Source Coding for Markov Chains
    Hayashi, Masahito
    Watanabe, Shun
    2013 51ST ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2013, : 875 - 882
  • [47] Non-Asymptotic Bounds on the Performance of Dual Methods for Resource Allocation Problems
    Goertzen, Simon
    Schmeink, Anke
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2014, 13 (06) : 3430 - 3441
  • [48] Guaranteed characterization of exact non-asymptotic confidence regions as defined by LSCR and SPS
    Kieffer, Michel
    Walter, Eric
    AUTOMATICA, 2014, 50 (02) : 507 - 512
  • [49] Non-asymptotic bandwidth selection for density estimation of discrete data
    Botev, Zdravko I.
    Kroese, Dirk P.
    METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2008, 10 (03) : 435 - 451
  • [50] Non-asymptotic Confidence Regions for the Transfer Functions of Errors-in-Variables Systems
    Khorasani, Masoud Moravej
    Weyer, Erik
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (05) : 2373 - 2388