Parallel inference for big data with the group Bayesian method

被引:0
|
作者
Guangbao Guo
Guoqi Qian
Lu Lin
Wei Shao
机构
[1] Shandong University of Technology,Department of Statistics
[2] The University of Melbourne,School of Mathematics and Statistics
[3] Shandong University,School of Mathematics
[4] Qufu Normal University,School of Management
来源
Metrika | 2021年 / 84卷
关键词
Data subsets; Group Gibbs; Parallel inference; 62F15; 62J12; 62D05;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, big datasets are often split into several subsets due to the storage requirements. We propose a parallel group Bayesian method for statistical inference in sparse big data. This method improves the existing methods in two aspects: the total datasets are also split into a data subset sequence and the parameter vector is divided into several sub-vectors. Besides, we add a weight sequence to optimize the sub-estimators when each of them has a different covariance matrix. We obtain several theoretical properties of the estimator. The results of numerical simulations show that our method is consistent with the theoretical results and is more effective than classic Markov chain Monte Carlo methods.
引用
收藏
页码:225 / 243
页数:18
相关论文
共 50 条
  • [1] Parallel inference for big data with the group Bayesian method
    Guo, Guangbao
    Qian, Guoqi
    Lin, Lu
    Shao, Wei
    METRIKA, 2021, 84 (02) : 225 - 243
  • [2] Bayesian inference of multi-group nuclear data by Monte Carlo sampling method
    Wu, Qu
    Peng, Xingjie
    Yu, Yingrui
    Li, Qing
    ANNALS OF NUCLEAR ENERGY, 2021, 161
  • [3] Bayesian inference on group differences in multivariate categorical data
    Russo, Massimiliano
    Durante, Daniele
    Scarpa, Bruno
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 126 : 136 - 149
  • [4] A Novel Parallel implementation of Naive Bayesian classifier for Big Data
    Katkar, Vijay D.
    Kulkarni, Siddhant Vijay
    2013 INTERNATIONAL CONFERENCE ON GREEN COMPUTING, COMMUNICATION AND CONSERVATION OF ENERGY (ICGCE), 2013, : 847 - 852
  • [5] Double-Parallel Monte Carlo for Bayesian analysis of big data
    Xue, Jingnan
    Liang, Faming
    STATISTICS AND COMPUTING, 2019, 29 (01) : 23 - 32
  • [6] Double-Parallel Monte Carlo for Bayesian analysis of big data
    Jingnan Xue
    Faming Liang
    Statistics and Computing, 2019, 29 : 23 - 32
  • [7] Parallel algorithms for Bayesian phylogenetic inference
    Feng, XZ
    Buell, DA
    Rose, JR
    Waddell, PJ
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2003, 63 (7-8) : 707 - 718
  • [8] A Variational Bayesian inference method for parametric imaging of PET data
    Castellaro, M.
    Rizzo, G.
    Tonietto, M.
    Veronese, M.
    Turkheimer, F. E.
    Chappell, M. A.
    Bertoldo, A.
    NEUROIMAGE, 2017, 150 : 136 - 149
  • [9] An Investigation of parallel road map inference from Big GPS Traces Data
    Elleuch, Wiam
    Wali, Ali
    Alimi, Adel M.
    INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 131 - 140
  • [10] An Improved Bayesian Inference Method for Data-Intensive Computing
    Ma, Feng
    Liu, Weiyi
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2012, 316 : 134 - 144