A STOCHASTIC NEWTON MCMC METHOD FOR LARGE-SCALE STATISTICAL INVERSE PROBLEMS WITH APPLICATION TO SEISMIC INVERSION

被引:286
|
作者
Martin, James [1 ]
Wilcox, Lucas C. [2 ]
Burstedde, Carsten
Ghattas, Omar [3 ,4 ]
机构
[1] Univ Texas Austin, Inst Computat Engn & Sci, Computat Sci Engn & Math Grad Program, Austin, TX 78712 USA
[2] USN, Postgrad Sch, Dept Appl Math, Monterey, CA 93943 USA
[3] Univ Texas Austin, Inst Computat Engn & Sci, Jackson Sch Geosci, Austin, TX 78712 USA
[4] Univ Texas Austin, Dept Mech Engn, Austin, TX 78712 USA
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2012年 / 34卷 / 03期
基金
美国国家科学基金会;
关键词
MCMC; Stochastic Newton; inverse problems; uncertainty quantification; Langevin dynamics; low-rank Hessian; MODEL-REDUCTION; POSTERIOR; CALIBRATION; ALGORITHMS; LANGEVIN;
D O I
10.1137/110845598
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We address the solution of large-scale statistical inverse problems in the framework of Bayesian inference. The Markov chain Monte Carlo (MCMC) method is the most popular approach for sampling the posterior probability distribution that describes the solution of the statistical inverse problem. MCMC methods face two central difficulties when applied to large-scale inverse problems: first, the forward models (typically in the form of partial differential equations) that map uncertain parameters to observable quantities make the evaluation of the probability density at any point in parameter space very expensive; and second, the high-dimensional parameter spaces that arise upon discretization of infinite-dimensional parameter fields make the exploration of the probability density function prohibitive. The challenge for MCMC methods is to construct proposal functions that simultaneously provide a good approximation of the target density while being inexpensive to manipulate. Here we present a so-called Stochastic Newton method in which MCMC is accelerated by constructing and sampling from a proposal density that builds a local Gaussian approximation based on local gradient and Hessian (of the log posterior) information. Thus, the method exploits tools (adjoint-based gradients and Hessians) that have been instrumental for fast (often mesh-independent) solution of deterministic inverse problems. Hessian manipulations (inverse, square root) are made tractable by a low-rank approximation that exploits the compact nature of the data misfit operator. This is analogous to a reduced model of the parameter-to-observable map. The method is applied to the Bayesian solution of an inverse medium problem governed by 1D seismic wave propagation. We compare the Stochastic Newton method with a reference black box MCMC method as well as a gradient-based Langevin MCMC method, and observe at least two orders of magnitude improvement in convergence for problems with up to 65 parameters. Numerical evidence suggests that a 1025 parameter problem converges at the same rate as the 65 parameter problem.
引用
收藏
页码:A1460 / A1487
页数:28
相关论文
共 50 条
  • [31] Large-Scale Distributed Bayesian Matrix Factorization using Stochastic Gradient MCMC
    Ahn, Sungjin
    Korattikara, Anoop
    Liu, Nathan
    Rajan, Suju
    Welling, Max
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 9 - 18
  • [32] Distributed Newton Method for Large-Scale Consensus Optimization
    Tutunov, Rasul
    Bou-Ammar, Haitham
    Jadbabaie, Ali
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (10) : 3983 - 3994
  • [33] Inexact semismooth, newton methods for large-scale complementarity problems
    Kanzow, C
    OPTIMIZATION METHODS & SOFTWARE, 2004, 19 (3-4): : 309 - 325
  • [34] Large-scale stochastic linear inversion using hierarchical matrices
    Ambikasaran, Sivaram
    Li, Judith Yue
    Kitanidis, Peter K.
    Darve, Eric
    COMPUTATIONAL GEOSCIENCES, 2013, 17 (06) : 913 - 927
  • [35] GENERALIZED SUBSPACE METHODS FOR LARGE-SCALE INVERSE PROBLEMS
    OLDENBURG, DW
    MCGILLIVRAY, PR
    ELLIS, RG
    GEOPHYSICAL JOURNAL INTERNATIONAL, 1993, 114 (01) : 12 - 20
  • [36] Large-Scale Inversion of Magnetotelluric Data Using Regularized Gauss-Newton Method in the Data Space
    Nadasi, Endre
    Gribenko, Alexander, V
    Zhdanov, Michael S.
    PURE AND APPLIED GEOPHYSICS, 2022, 179 (10) : 3785 - 3806
  • [37] Method for Speech Inversion with Large Scale Statistical Evaluation
    Rasilo, Heikki
    Laine, Unto K.
    Rasanen, Okko
    Altosaar, Toomas
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2704 - 2707
  • [38] Boosting Stochastic Newton with Entropy Constraint for Large-Scale Image Classification
    Ali, Wafa Bel Haj
    Nock, Richard
    Barlaud, Michel
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 232 - 237
  • [39] Application of the conjugate projected gradient method to large-scale contact problems
    Miyamura, T
    Makinouchi, A
    COMPUTATIONAL FLUID AND SOLID MECHANICS 2003, VOLS 1 AND 2, PROCEEDINGS, 2003, : 469 - 472
  • [40] Application of the modified barrier method in large-scale quadratic programming problems
    Vassiliadis, VS
    COMPUTERS & CHEMICAL ENGINEERING, 1996, 20 : S243 - S248