Adaptive Strategy for Stratified Monte Carlo Sampling

被引:0
|
作者
Carpentier, Alexandra [1 ]
Munos, Remi [2 ,4 ]
Antos, Andras [3 ]
机构
[1] Ctr Math Sci, Stat Lab, Wilberforce Rd, Cambridge CB3 0WB, England
[2] Google DeepMind, London, England
[3] Budapest Univ Technol & Econ, H-1111 Budapest, Hungary
[4] Inria Lille Nord Europe, Lille, France
关键词
adaptive sampling; bandit theory; stratified Monte Carlo; minimax strategies; active learning; ALLOCATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of stratified sampling for Monte Carlo integration of a random variable. We model this problem in a K-armed bandit, where the arms represent the K strata. The goal is to estimate the integral mean, that is a weighted average of the mean values of the arms. The learner is allowed to sample the variable n times, but it can decide on-line which stratum to sample next. We propose an UCB-type strategy that samples the arms according to an upper bound on their estimated standard deviations. We compare its performance to an ideal sample allocation that knows the standard deviations of the arms. For sub-Gaussian arm distributions, we provide bounds on the total regret: a distribution-dependent bound of order poly(lambda(-1)(min))(O) over tilde (n(-3/2))(1) that depends on a measure of the disparity lambda(min) of the per stratum variances and a distribution-free bound poly(K)(O) over tilde (n(-7/6)) that does not. We give similar, but somewhat sharper bounds on a proxy of the regret. The problem-independent bound for this proxy matches its recent minimax lower bound in terms of n up to a log n factor.
引用
收藏
页码:2231 / 2271
页数:41
相关论文
共 50 条
  • [1] Adaptive strategy for stratified Monte Carlo sampling
    Carpentier, Alexandra
    Munos, Remi
    Antosy, András
    [J]. Journal of Machine Learning Research, 2015, 16 : 2231 - 2271
  • [2] MULTIDIMENSIONAL MONTE-CARLO QUADRATURE WITH ADAPTIVE STRATIFIED SAMPLING
    GALLAHER, LJ
    [J]. COMMUNICATIONS OF THE ACM, 1973, 16 (01) : 49 - 50
  • [3] IMPLEMENTATION OF STRATIFIED SAMPLING FOR MONTE-CARLO APPLICATIONS
    BROWN, RS
    HENDRICKS, JS
    [J]. NUCLEAR SCIENCE AND ENGINEERING, 1987, 97 (03) : 245 - 248
  • [4] Adaptive Bayesian Sampling with Monte Carlo EM
    Roychowdhury, Anirban
    Parthasarathy, Srinivasan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [5] Estimation of distributions via multilevel Monte Carlo with stratified sampling
    Taverniers, Soren
    Tartakovsky, Daniel M.
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 419
  • [6] Adaptive stratified importance sampling: hybridization of extrapolation and importance sampling Monte Carlo methods for estimation of wind turbine extreme loads
    Graf, Peter
    Dykes, Katherine
    Damiani, Rick
    Jonkman, Jason
    Veers, Paul
    [J]. WIND ENERGY SCIENCE, 2018, 3 (02) : 475 - 487
  • [7] Adaptive stratified Monte Carlo algorithm for numerical computation of integrals
    Sayah, Toni
    [J]. MATHEMATICS AND COMPUTERS IN SIMULATION, 2019, 157 (143-158) : 143 - 158
  • [8] Adaptive KLD Sampling Based Monte Carlo Localization
    Sun Dihua
    Qin Ha
    Zhao Min
    Cheng Senlin
    Yang Liangyi
    [J]. PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 4154 - 4159
  • [9] An approximate Monte Carlo adaptive importance sampling method
    Booth, TE
    [J]. NUCLEAR SCIENCE AND ENGINEERING, 2001, 138 (01) : 96 - 103
  • [10] ADAPTIVE MONTE CARLO SAMPLING GRADIENT METHOD FOR OPTIMIZATION
    Tan, Hui
    [J]. 2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 4596 - 4597