Trust-region variational inference with gaussian mixture models

被引:0
|
作者
Arenz, Oleg [1 ]
Zhong, Mingjun [2 ]
Neumann, Gerhard [3 ,4 ]
机构
[1] Intelligent Autonomous Systems, Technische Universität Darmstadt, Hochschulstraße 10, Darmstadt,64289, Germany
[2] Department of Computing Science, University of Aberdeen, King's College, Aberdeen,AB24 3FX, United Kingdom
[3] Autonomous Learning Robots, Karlsruhe Institute of Technology, Adenauerring 4, Karlsruhe,76131, Germany
[4] Bosch Center for Artificial Intelligence, Robert-Bosch-Campus 1, Renningen,71272, Germany
关键词
Probability distributions;
D O I
暂无
中图分类号
学科分类号
摘要
Many methods for machine learning rely on approximate inference from intractable probability distributions. Variational inference approximates such distributions by tractable models that can be subsequently used for approximate inference. Learning sufficiently accurate approximations requires a rich model family and careful exploration of the relevant modes of the target distribution. We propose a method for learning accurate GMM approximations of intractable probability distributions based on insights from policy search by using information-geometric trust regions for principled exploration. For efficient improvement of the GMM approximation, we derive a lower bound on the corresponding optimization objective enabling us to update the components independently. Our use of the lower bound ensures convergence to a stationary point of the original objective. The number of components is adapted online by adding new components in promising regions and by deleting components with negligible weight. We demonstrate on several domains that we can learn approximations of complex, multimodal distributions with a quality that is unmet by previous variational inference methods, and that the GMM approximation can be used for drawing samples that are on par with samples created by state-of-theart MCMC samplers while requiring up to three orders of magnitude less computational resources. © 2020 Oleg Arenz, Mingjun Zhong and Gerhard Neumann.
引用
收藏
相关论文
共 50 条
  • [31] Stochastic optimization using a trust-region method and random models
    Chen, R.
    Menickelly, M.
    Scheinberg, K.
    MATHEMATICAL PROGRAMMING, 2018, 169 (02) : 447 - 487
  • [32] Stochastic optimization using a trust-region method and random models
    R. Chen
    M. Menickelly
    K. Scheinberg
    Mathematical Programming, 2018, 169 : 447 - 487
  • [33] Variational Bayes inference of spatial mixture models for segmentation
    Woolrich, Mark W.
    Behrens, Timothy E.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2006, 25 (10) : 1380 - 1391
  • [34] Streaming Variational Inference for Bayesian Nonparametric Mixture Models
    Tank, Alex
    Foti, Nicholas J.
    Fox, Emily B.
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 968 - 976
  • [35] Bayesian Estimation of Beta Mixture Models with Variational Inference
    Ma, Zhanyu
    Leijon, Arne
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (11) : 2160 - 2173
  • [36] Second-Order Trust-Region Optimization for Data-Limited Inference
    Ranganath, Aditya
    DeGuchy, Omar
    Singhal, Mukesh
    Marcia, Roummel E.
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 2059 - 2063
  • [37] GPGPU Implementation of Variational Bayesian Gaussian Mixture Models
    Nishimoto, Hiroki
    Zhang, Renyuan
    Nakashima, Yasuhiko
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (03) : 611 - 622
  • [38] Variational Gaussian Mixture Models for Speech Emotion Recognition
    Mishra, Harendra Kumar
    Sekhar, C. Chandra
    ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 183 - 186
  • [39] GPGPU Implementation of Variational Bayesian Gaussian Mixture Models
    Nishimoto, Hiroki
    Nakada, Takashi
    Nakashima, Yasuhiko
    2019 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR 2019), 2019, : 185 - 190
  • [40] Variational bayesian feature selection for Gaussian mixture models
    Valente, F
    Wellekens, C
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 513 - 516