Parallel inference for big data with the group Bayesian method

被引:0
|
作者
Guangbao Guo
Guoqi Qian
Lu Lin
Wei Shao
机构
[1] Shandong University of Technology,Department of Statistics
[2] The University of Melbourne,School of Mathematics and Statistics
[3] Shandong University,School of Mathematics
[4] Qufu Normal University,School of Management
来源
Metrika | 2021年 / 84卷
关键词
Data subsets; Group Gibbs; Parallel inference; 62F15; 62J12; 62D05;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, big datasets are often split into several subsets due to the storage requirements. We propose a parallel group Bayesian method for statistical inference in sparse big data. This method improves the existing methods in two aspects: the total datasets are also split into a data subset sequence and the parameter vector is divided into several sub-vectors. Besides, we add a weight sequence to optimize the sub-estimators when each of them has a different covariance matrix. We obtain several theoretical properties of the estimator. The results of numerical simulations show that our method is consistent with the theoretical results and is more effective than classic Markov chain Monte Carlo methods.
引用
收藏
页码:225 / 243
页数:18
相关论文
共 50 条
  • [41] Parallel Markov chain Monte Carlo for Bayesian hierarchical models with big data, in two stages
    Wei, Zheng
    Conlon, Erin M.
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (11) : 1917 - 1936
  • [42] A Massively Parallel Bayesian Approach to Factorization-Based Analysis of Big Time Series Data
    Gao T.
    Liu Y.
    Tang Y.
    Zhang L.
    Chen D.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (07): : 1567 - 1577
  • [43] Statistical inference on survival data in group-parallel clinical trials with treatment switching
    Ding, Chang
    Tse, Siu-keung
    Yang, Ronghai
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 64 : 269 - 280
  • [44] Magnitude-based Inference is not Bayesian and is not a valid method of inference
    Sainani, Kristin L.
    Lohse, Keith R.
    Jones, Paul Remy
    Vickers, Andrew
    SCANDINAVIAN JOURNAL OF MEDICINE & SCIENCE IN SPORTS, 2019, 29 (09) : 1428 - 1436
  • [45] Perspectives on Bayesian Methods and Big Data
    Greg M. Allenby
    Eric T. Bradlow
    Edward I. George
    John Liechty
    Robert E. McCulloch
    Customer Needs and Solutions, 2014, 1 (3) : 169 - 175
  • [46] Efficient statistical inference for a parallel study with missing data by using an exact method
    Shan, Guogen
    Hutson, Alan
    Wilding, Gregory E.
    Ma, Changxing
    Tian, Guo-Liang
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2019, 29 (03) : 478 - 490
  • [47] Parallel Bayesian inference of range and reflectance from LaDAR profiles
    Ye, Jing
    Wallace, Andrew M.
    Al Zain, Abdallah
    Thompson, John
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (04) : 383 - 399
  • [48] A Parallel Algorithm for Bayesian Network Inference using Arithmetic Circuits
    Vasimuddin, Md.
    Chockalingam, Sriram P.
    Aluru, Srinivas
    2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 34 - 43
  • [49] A Robustified Posterior for Bayesian Inference on a Large Number of Parallel Effects
    Liao, J. G.
    Berg, Arthur
    McMurry, Timothy L.
    AMERICAN STATISTICIAN, 2021, 75 (02): : 145 - 151
  • [50] Parallel probabilistic graphical model approach for nonparametric Bayesian inference
    Lee, Wonjung
    Zabaras, Nicholas
    JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 372 : 546 - 563