Monte Carlo Gradient Estimation in Machine Learning

被引:2
|
作者
Mohamed, Shakir [1 ]
Rosca, Mihaela [1 ,2 ]
Figurnov, Michael [1 ]
Mnih, Andriy [1 ]
机构
[1] DeepMind, London, England
[2] UCL, London, England
关键词
gradient estimation; Monte Carlo; sensitivity analysis; score-function estimator; pathwise estimator; measure-valued estimator; variance reduction; PERTURBATION ANALYSIS; SENSITIVITY-ANALYSIS; AUTOMATIC DIFFERENTIATION; APPROXIMATION; OPTIMIZATION; INFERENCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper is a broad and accessible survey of the methods we have at our disposal for Monte Carlo gradient estimation in machine learning and across the statistical sciences: the problem of computing the gradient of an expectation of a function with respect to parameters defining the distribution that is integrated; the problem of sensitivity analysis. In machine learning research, this gradient problem lies at the core of many learning problems, in supervised, unsupervised and reinforcement learning. We will generally seek to rewrite such gradients in a form that allows for Monte Carlo estimation, allowing them to be easily and efficiently used and analysed. We explore three strategies-the pathwise, score function, and measure valued gradient estimators-exploring their historical development, derivation, and underlying assumptions. We describe their use in other fields, show how they are related and can be combined, and expand on their possible generalisations. Wherever Monte Carlo gradient estimators have been derived and deployed in the past, important advances have followed. A deeper and more widely-held understanding of this problem will lead to further advances, and it is these advances that we wish to support.
引用
收藏
页数:62
相关论文
共 50 条
  • [1] Gradient estimation for applied Monte Carlo analyses
    Melchers, RE
    Ahammed, M
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2002, 78 (03) : 283 - 288
  • [2] Monte Carlo gradient estimation in high dimensions
    Patelli, E.
    Pradlwarter, H. J.
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2010, 81 (02) : 172 - 188
  • [3] Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence
    Knaus, Michael C.
    Lechner, Michael
    Strittmatter, Anthony
    [J]. ECONOMETRICS JOURNAL, 2021, 24 (01): : 134 - 161
  • [4] Intelligent interpolation by Monte Carlo machine learning
    Jia, Yongna
    Yu, Siwei
    Ma, Jianwei
    [J]. GEOPHYSICS, 2018, 83 (02) : V83 - V97
  • [5] Machine Learning Diffusion Monte Carlo Forces
    Huang, Cancan
    Rubenstein, Brenda M.
    [J]. JOURNAL OF PHYSICAL CHEMISTRY A, 2023, 127 (01): : 339 - 355
  • [6] Machine Learning Diffusion Monte Carlo Energies
    Ryczko, Kevin
    Krogel, Jaron T.
    Tamblyn, Isaac
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2022, 18 (12) : 7695 - 7701
  • [7] A Machine Learning Approach for Filtering Monte Carlo Noise
    Kalantari, Nima Khademi
    Bako, Steve
    Sen, Pradeep
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04):
  • [8] Estimation of the gradient of the solution of an adjoint diffusion equation by the Monte Carlo method
    Burmistrov, AV
    Mikhailov, GA
    [J]. RUSSIAN JOURNAL OF NUMERICAL ANALYSIS AND MATHEMATICAL MODELLING, 2002, 17 (04) : 367 - 380
  • [9] On performance potentials and conditional Monte Carlo for gradient estimation for Markov chains
    Cao, XR
    Fu, MC
    Hu, JQ
    [J]. ANNALS OF OPERATIONS RESEARCH, 1999, 87 (0) : 263 - 272
  • [10] Conditional Monte Carlo Gradient Estimation in Economic Design of Control Limits
    Fu, Michael C.
    Lele, Shreevardhan
    Vossen, Thomas W. M.
    [J]. PRODUCTION AND OPERATIONS MANAGEMENT, 2009, 18 (01) : 60 - 77