We study a stochastically perturbed version of the well-known Krasnoselskii--Mann iteration for computing fixed points of nonexpansive maps in finite dimensional normed spaces. We discuss sufficient conditions on the stochastic noise and stepsizes that guarantee almost sure convergence of the iterates towards a fixed point and derive nonasymptotic error bounds and convergence rates for the fixed-point residuals. Our main results concern the case of a martingale difference noise with variances that can possibly grow unbounded. This supports an application to reinforcement learning for average reward Markov decision processes, for which we establish convergence and asymptotic rates. We also analyze in depth the case where the noise has uniformly bounded variance, obtaining error bounds with explicit computable constants.
机构:
Zhejiang Ocean Univ, Sch Math Phys & Informat Sci, Zhoushan 316004, Peoples R China
Zhejiang Normal Univ, Coll Math Phys & Informat Engn, Jinhua 321004, Peoples R ChinaZhejiang Ocean Univ, Sch Math Phys & Informat Sci, Zhoushan 316004, Peoples R China
Hao, Yan
Cho, Sun Young
论文数: 0引用数: 0
h-index: 0
机构:
Gyeongsang Natl Univ, Dept Math, Jinju 660701, South KoreaZhejiang Ocean Univ, Sch Math Phys & Informat Sci, Zhoushan 316004, Peoples R China