Approximation and sampling of multivariate probability distributions in the tensor train decomposition

被引:0
|
作者
Sergey Dolgov
Karim Anaya-Izquierdo
Colin Fox
Robert Scheichl
机构
[1] University of Bath,
[2] University of Otago,undefined
[3] University of Heidelberg,undefined
来源
Statistics and Computing | 2020年 / 30卷
关键词
Multivariate distributions; Surrogate models; Tensor decomposition; MCMC; Importance weights;
D O I
暂无
中图分类号
学科分类号
摘要
General multivariate distributions are notoriously expensive to sample from, particularly the high-dimensional posterior distributions in PDE-constrained inverse problems. This paper develops a sampler for arbitrary continuous multivariate distributions that is based on low-rank surrogates in the tensor train format, a methodology that has been exploited for many years for scalable, high-dimensional density function approximation in quantum physics and chemistry. We build upon recent developments of the cross approximation algorithms in linear algebra to construct a tensor train approximation to the target probability density function using a small number of function evaluations. For sufficiently smooth distributions, the storage required for accurate tensor train approximations is moderate, scaling linearly with dimension. In turn, the structure of the tensor train surrogate allows sampling by an efficient conditional distribution method since marginal distributions are computable with linear complexity in dimension. Expected values of non-smooth quantities of interest, with respect to the surrogate distribution, can be estimated using transformed independent uniformly-random seeds that provide Monte Carlo quadrature or transformed points from a quasi-Monte Carlo lattice to give more efficient quasi-Monte Carlo quadrature. Unbiased estimates may be calculated by correcting the transformed random seeds using a Metropolis–Hastings accept/reject step, while the quasi-Monte Carlo quadrature may be corrected either by a control-variate strategy or by importance weighting. We show that the error in the tensor train approximation propagates linearly into the Metropolis–Hastings rejection rate and the integrated autocorrelation time of the resulting Markov chain; thus, the integrated autocorrelation time may be made arbitrarily close to 1, implying that, asymptotic in sample size, the cost per effectively independent sample is one target density evaluation plus the cheap tensor train surrogate proposal that has linear cost with dimension. These methods are demonstrated in three computed examples: fitting failure time of shock absorbers; a PDE-constrained inverse diffusion problem; and sampling from the Rosenbrock distribution. The delayed rejection adaptive Metropolis (DRAM) algorithm is used as a benchmark. In all computed examples, the importance weight-corrected quasi-Monte Carlo quadrature performs best and is more efficient than DRAM by orders of magnitude across a wide range of approximation accuracies and sample sizes. Indeed, all the methods developed here significantly outperform DRAM in all computed examples.
引用
收藏
页码:603 / 625
页数:22
相关论文
共 50 条
  • [41] tgEDMD: Approximation of the Kolmogorov Operator in Tensor Train Format
    Luecke, Marvin
    Nueske, Feliks
    JOURNAL OF NONLINEAR SCIENCE, 2022, 32 (04)
  • [42] Error Analysis of Tensor-Train Cross Approximation
    Qin, Zhen
    Lidiak, Alexander
    Gong, Zhexuan
    Tang, Gongguo
    Wakin, Michael B.
    Zhu, Zhihui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [43] Tensor Train Optimization for Conformational Sampling of Organic Molecules
    Zurek, Christopher
    Mallaev, Ruslan A.
    Paul, Alexander C.
    van Staalduinen, Nils
    Pracht, Philipp
    Ellerbrock, Roman
    Bannwarth, Christoph
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2025, 21 (03) : 1459 - 1475
  • [44] PROBABILITY-DISTRIBUTIONS WITH GIVEN MULTIVARIATE MARGINALS
    COHEN, L
    JOURNAL OF MATHEMATICAL PHYSICS, 1984, 25 (08) : 2402 - 2403
  • [45] Discrete analogues of continuous multivariate probability distributions
    Martin Wiegand
    Saralees Nadarajah
    Yuanyuan Zhang
    Annals of Operations Research, 2020, 292 : 183 - 190
  • [46] Discrete analogues of continuous multivariate probability distributions
    Wiegand, Martin
    Nadarajah, Saralees
    Zhang, Yuanyuan
    ANNALS OF OPERATIONS RESEARCH, 2020, 292 (01) : 183 - 190
  • [47] COPULAE AND MULTIVARIATE PROBABILITY DISTRIBUTIONS IN FINANCE Preface
    Adcock, Chris
    EUROPEAN JOURNAL OF FINANCE, 2009, 15 (7-8): : 608 - 608
  • [48] Tensor Networks for Latent Variable Analysis: Novel Algorithms for Tensor Train Approximation
    Phan, Anh-Huy
    Cichocki, Andrzej
    Uschmajew, Andre
    Tichavsky, Petr
    Luta, George
    Mandic, Danilo P.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4622 - 4636
  • [49] Distributed Non-Negative Tensor Train Decomposition
    Bhattarai, Manish
    Chennupati, Gopinath
    Skau, Erik
    Vangara, Raviteja
    Djidjev, Hirsto
    Alexandrov, Boian S.
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [50] Distributed and Randomized Tensor Train Decomposition for Feature Extraction
    Fonal, Krzysztof
    Zdunek, Rafal
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,