Estimation and inference in sparse multivariate regression and conditional Gaussian graphical models under an unbalanced distributed setting

被引:0
|
作者
Nezakati, Ensiyeh [1 ]
Pircalabelu, Eugen [1 ]
机构
[1] UCLouvain, Inst Stat Biostat & Actuarial Sci, Voie Roman Pays 20, B-1348 Louvain La Neuve, Belgium
来源
ELECTRONIC JOURNAL OF STATISTICS | 2024年 / 18卷 / 01期
关键词
Multivariate regression models; conditional Gaus- sian graphical models; debiased estimation; precision matrix; sparsity; un- balanced distributed setting; PRECISION MATRIX ESTIMATION; COVARIANCE ESTIMATION; CONFIDENCE-INTERVALS; SELECTION; CONVERGENCE; GENOMICS; LASSO; RATES;
D O I
10.1214/23-EJS2193
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper proposes a distributed estimation and inferential framework for sparse multivariate regression and conditional Gaussian graphical models under the unbalanced splitting setting. This type of data splitting arises when the datasets from different sources cannot be aggregated on one single machine or when the available machines are of different powers. In this paper, the number of covariates, responses and machines grow with the sample size, while sparsity is imposed. Debiased estimators of the coefficient matrix and of the precision matrix are proposed on every single machine and theoretical guarantees are provided. Moreover, new aggregated estimators that pool information across the machines using a pseudo log -likelihood function are proposed. It is shown that they enjoy efficiency and asymptotic normality as the number of machines grows with the sample size. The performance of these estimators is investigated via a simulation study and a real data example. It is shown empirically that the performances of these estimators are close to those of the non -distributed estimators which use the entire dataset.
引用
收藏
页码:599 / 652
页数:54
相关论文
共 50 条