An algorithm for distributed Bayesian inference

被引:4
|
作者
Shyamalkumar, Nariankadu D. [1 ]
Srivastava, Sanvesh [1 ]
机构
[1] Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
来源
STAT | 2022年 / 11卷 / 01期
基金
美国国家科学基金会;
关键词
data augmentation; distributed computing; divide-and-conquer; location-scatter family; Monte Carlo computations; Wasserstein distance; BARYCENTERS; MODELS;
D O I
10.1002/sta4.432
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Monte Carlo algorithms, such as Markov chain Monte Carlo (MCMC) and Hamiltonian Monte Carlo (HMC), are routinely used for Bayesian inference; however, these algorithms are prohibitively slow in massive data settings because they require multiple passes through the full data in every iteration. Addressing this problem, we develop a scalable extension of these algorithms using the divide-and-conquer (D&C) technique that divides the data into a sufficiently large number of subsets, draws parameters in parallel on the subsets using a powered likelihood and produces Monte Carlo draws of the parameter by combining parameter draws obtained from each subset. The combined parameter draws play the role of draws from the original sampling algorithm. Our main contributions are twofold. First, we demonstrate through diverse simulated and real data analyses focusing on generalized linear models (GLMs) that our distributed algorithm delivers comparable results as the current state-of-the-art D&C algorithms in terms of statistical accuracy and computational efficiency. Second, providing theoretical support for our empirical observations, we identify regularity assumptions under which the proposed algorithm leads to asymptotically optimal inference. We also provide illustrative examples focusing on normal linear and logistic regressions where parts of our D&C algorithm are analytically tractable.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Distributed bearing-only target tracking algorithm based on variational Bayesian inference under random measurement anomalies
    杨浩然
    CHEN Yu
    HU Zhentao
    JIA Haoqian
    High Technology Letters, 2025, 31 (01) : 86 - 94
  • [32] An inference algorithm for probabilistic fault management in distributed systems
    Ding, JG
    Krämer, B
    Bai, YC
    Chen, HS
    NETWORK CONTROL AND ENGINEERING FOR QOS, SECURITY AND MOBILITY, III, 2005, 165 : 193 - 204
  • [33] Simulation of Bayesian Learning and Inference on Distributed Stochastic Spiking Neural Networks
    Ahmed, Khadeer
    Shrestha, Amar
    Qiu, Qinru
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1044 - 1051
  • [34] Macro programming through Bayesian networks: Distributed inference and anomaly detection
    Mamei, Marco
    Nagpal, Radhika
    FIFTH ANNUAL IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, : 87 - +
  • [35] A Bayesian metareasoner for algorithm selection for real-time Bayesian network inference problems
    Guo, HP
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 983 - 983
  • [36] Abductive inference in Bayesian networks using distributed overlapping swarm intelligence
    Nathan Fortier
    John Sheppard
    Shane Strasser
    Soft Computing, 2015, 19 : 981 - 1001
  • [37] Classical and Bayesian inference of Cpy for generalized Lindley distributed quality characteristic
    Saha, Mahendra
    Dey, Sanku
    Yadav, Abhimanyu Singh
    Kumar, Sumit
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2019, 35 (08) : 2593 - 2611
  • [38] Bayesian inference in a distributed associative neural network for adaptive signal processing
    Zeng, Qianglong
    Zeng, Ganwen
    ICINCO 2006: Proceedings of the Third International Conference on Informatics in Control, Automation and Robotics: SIGNAL PROCESSING, SYSTEMS MODELING AND CONTROL, 2006, : 177 - 181
  • [39] Scaling up Bayesian variational inference using distributed computing clusters
    Masegosa, Andres R.
    Martinez, Ana M.
    Langseth, Helge
    Nielsen, Thomas D.
    Salmeron, Antonio
    Ramos-Lopez, Dario
    Madsen, Anders L.
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 88 : 435 - 451
  • [40] Distributed Bayesian Parameter Inference for Physics-Informed Neural Networks
    Bai, He
    Bhar, Kinjal
    George, Jemin
    Busart, Carl
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2911 - 2916