Recursive Quantile Estimation: Non-Asymptotic Confidence Bounds

被引:0
|
作者
Chen, Likai [1 ]
Keilbar, Georg [2 ]
Wu, Wei Biao [3 ]
机构
[1] Washington Univ, Dept Math & Stat, St Louis, MO 63130 USA
[2] Univ Vienna, Dept Stat & Operat Res, Vienna, Austria
[3] Univ Chicago, Dept Stat, Chicago, IL USA
关键词
Finite sample bounds; quantiles; stochastic gradient descent; Polyak-Ruppert averaging; recursive estimation; STOCHASTIC-APPROXIMATION; HILBERT-SPACES; REGRESSION; INFERENCE; RISK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers the recursive estimation of quantiles using the stochastic gradient descent (SGD) algorithm with Polyak-Ruppert averaging. The algorithm offers a computationally and memory efficient alternative to the usual empirical estimator. Our focus is on studying the non-asymptotic behavior by providing exponentially decreasing tail probability bounds under mild assumptions on the smoothness of the density functions. This novel non-asymptotic result is based on a bound of the moment generating function of the SGD estimate. We apply our result to the problem of best arm identification in a multi-armed stochastic bandit setting under quantile preferences.
引用
收藏
页数:25
相关论文
共 50 条