Nonconvex sampling with the Metropolis-adjusted Langevin algorithm

被引:0
|
作者
Mangoubi, Oren [1 ]
Vishnoi, Nisheeth K. [2 ]
机构
[1] Ecole Polytech Federale Lausanne EPFL, Lausanne, Switzerland
[2] Yale Univ, New Haven, CT USA
来源
基金
瑞士国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Langevin Markov chain algorithms are widely deployed methods to sample from distributions in challenging high-dimensional and non-convex statistics and machine learning applications. Despite this, current bounds for the Langevin algorithms are worse than those of competing algorithms in many important situations, for instance when sampling from weakly log-concave distributions, or when sampling or optimizing non-convex log-densities. We obtain improved bounds in many of these situations, showing that the Metropolis-adjusted Langevin algorithm (MALA) is faster than the best bounds for its competitor algorithms when the target distribution satisfies weak third- and fourth- order regularity properties associated with the input data. In many settings, our regularity conditions are weaker than the usual Euclidean operator norm regularity properties, allowing us to show faster bounds for a much larger class of distributions than would be possible with the usual Euclidean operator norm approach, including in statistics and machine learning applications where the data satisfy a certain incoherence condition. In particular, we show that using our regularity conditions one can obtain faster bounds for applications which include sampling problems in Bayesian logistic regression with weakly convex priors, and the nonconvex optimization problem of learning linear classifiers with zero-one loss functions. Our main technical contribution is an analysis of the Metropolis acceptance probability of MALA in terms of its "energy-conservation error," and a bound for this error in terms of third- and fourth- order regularity conditions. The combination of this higher-order analysis of the energy conservation error with the conductance method is key to obtaining bounds which have a sub-linear dependence on the dimension d in the non-strongly logconcave setting.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Langevin diffusions and the Metropolis-adjusted Langevin algorithm
    Xifara, T.
    Sherlock, C.
    Livingstone, S.
    Byrne, S.
    Girolami, M.
    [J]. STATISTICS & PROBABILITY LETTERS, 2014, 91 : 14 - 19
  • [2] Minimax Mixing Time of the Metropolis-Adjusted Langevin Algorithm for Log-Concave Sampling
    Wu, Keru
    Schmidler, Scott
    Chen, Yuansi
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23 : 1 - 63
  • [3] Minimax Mixing Time of the Metropolis-Adjusted Langevin Algorithm for Log-Concave Sampling
    Wu, Keru
    Schmidler, Scott
    Chen, Yuansi
    [J]. Journal of Machine Learning Research, 2022, 23
  • [4] Particle Metropolis-adjusted Langevin algorithms
    Nemeth, Christopher
    Sherlock, Chris
    Fearnhead, Paul
    [J]. BIOMETRIKA, 2016, 103 (03) : 701 - 717
  • [5] On the Computational Complexity of Metropolis-Adjusted Langevin Algorithms for Bayesian Posterior Sampling
    Tang, Rong
    Yang, Yun
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 79
  • [6] On geometric convergence for the Metropolis-adjusted Langevin algorithm under simple conditions
    Oliviero-Durmus, Alain
    Moulines, Eric
    [J]. BIOMETRIKA, 2024, 111 (01) : 273 - 289
  • [7] GAUSSIAN APPROXIMATIONS OF SDES IN METROPOLIS-ADJUSTED LANGEVIN ALGORITHMS
    Sarkka, Simo
    Merkatas, Christos
    Karvonen, Toni
    [J]. 2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [8] A neural network assisted Metropolis adjusted Langevin algorithm
    Mueller, Christian
    Diedam, Holger
    Mrziglod, Thomas
    Schuppert, Andreas
    [J]. MONTE CARLO METHODS AND APPLICATIONS, 2020, 26 (02): : 93 - 111
  • [9] An Adaptive Version for the Metropolis Adjusted Langevin Algorithm with a Truncated Drift
    Yves F. Atchadé
    [J]. Methodology and Computing in Applied Probability, 2006, 8 : 235 - 254
  • [10] An adaptive version for the Metropolis Adjusted Langevin algorithm with a truncated drift
    Atchade, Yves F.
    [J]. METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2006, 8 (02) : 235 - 254