The proximal Robbins-Monro method

被引:7
|
作者
Toulis, Panos [1 ]
Horel, Thibaut [2 ]
Airoldi, Edoardo M. [3 ]
机构
[1] Univ Chicago, Booth Sch Business, Chicago, IL 60637 USA
[2] Harvard Univ, Dept Comp Sci, Cambridge, MA 02138 USA
[3] Temple Univ, Fox Sch Business, Philadelphia, PA 19122 USA
基金
美国国家科学基金会;
关键词
implicit updates; iterative estimation; proximal operators; stochastic approximation; stochastic fixed‐ point equations; stochastic gradient descent; ALGORITHM; ROBUST;
D O I
10.1111/rssb.12405
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The need for statistical estimation with large data sets has reinvigorated interest in iterative procedures and stochastic optimization. Stochastic approximations are at the forefront of this recent development as they yield procedures that are simple, general and fast. However, standard stochastic approximations are often numerically unstable. Deterministic optimization, in contrast, increasingly uses proximal updates to achieve numerical stability in a principled manner. A theoretical gap has thus emerged. While standard stochastic approximations are subsumed by the framework Robbins and Monro (The annals of mathematical statistics, 1951, pp. 400-407), there is no such framework for stochastic approximations with proximal updates. In this paper, we conceptualize a proximal version of the classical Robbins-Monro procedure. Our theoretical analysis demonstrates that the proposed procedure has important stability benefits over the classical Robbins-Monro procedure, while it retains the best known convergence rates. Exact implementations of the proximal Robbins-Monro procedure are challenging, but we show that approximate implementations lead to procedures that are easy to implement, and still dominate standard procedures by achieving numerical stability, practically without trade-offs. Moreover, approximate proximal Robbins-Monro procedures can be applied even when the objective cannot be calculated analytically, and so they generalize stochastic proximal procedures currently in use.
引用
收藏
页码:188 / 212
页数:25
相关论文
共 50 条