ROBUST, SPARSE AND SCALABLE INFERENCE USING BOOTSTRAP AND VARIABLE SELECTION FUSION

被引:0
|
作者
Mozafari-Majd, Emadaldin [1 ]
Koivunen, Visa [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, POB 15400, FU-0076 Aalto, Finland
基金
芬兰科学院;
关键词
statistical inference; robust; bootstrap; sparsity; high-dimensional; large-scale; REGRESSION; RIDGE;
D O I
10.1109/camsap45676.2019.9022472
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we address the challenging problem of conducting statistical inference for large-scale data sets in the presence of sparsity and outlying observations. In particular, processing and storing such data on a single computing node may be infeasible due to its high volume and dimensionality. Therefore, the large-scale data is subdivided into smaller distinct subsets that may be stored and processed in different nodes. We propose a robust and scalable statistical inference method using a two-stage algorithm where variable selection is performed via fusing the selected support from each distinct subset of data. The actual parameter and confidence interval estimation takes place in the second stage using a robust extension of Bag of Little Bootstraps (BLB) technique. In order to exploit sparsity and ensure robustness, MM-Lasso estimator is used to select variables for each subset of data. The selections are then fused to find the support for the original large-scale data. In the second stage, the robust MM-estimator is used for the selected support. The simulation studies demonstrated the highly reliable performance of the algorithm in variable selection and providing reliable confidence intervals even if the estimation problem in the subsets is slightly under-determined.
引用
收藏
页码:271 / 275
页数:5
相关论文
共 50 条
  • [21] UNIFORM ASYMPTOTIC INFERENCE AND THE BOOTSTRAP AFTER MODEL SELECTION
    Tibshirani, Ryan J.
    Rinaldo, Alessandro
    Tibshirani, Rob
    Wasserman, Larry
    [J]. ANNALS OF STATISTICS, 2018, 46 (03): : 1255 - 1287
  • [22] Bayesian robust inference of sample selection using selection-t models
    Ding, Peng
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 124 : 451 - 464
  • [23] Inference after variable selection using restricted permutation methods
    Wang, Rui
    Lagakos, Stephen W.
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2009, 37 (04): : 625 - 644
  • [24] Variable Selection in Causal Inference using a Simultaneous Penalization Method
    Ertefaie, Ashkan
    Asgharian, Masoud
    Stephens, David A.
    [J]. JOURNAL OF CAUSAL INFERENCE, 2018, 6 (01)
  • [25] BOOTSTRAP SELECTION PROCEDURES BASED ON ROBUST ESTIMATORS
    SWANEPOEL, JWH
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1983, 12 (18) : 2059 - 2083
  • [26] Scalable and robust sparse subspace clustering using randomized clustering and multilayer graphs
    Abdolali, Maryam
    Gillis, Nicolas
    Rahmati, Mohammad
    [J]. SIGNAL PROCESSING, 2019, 163 : 166 - 180
  • [27] A Robust Variable Selection Method for Sparse Online Regression via the Elastic Net Penalty
    Wang, Wentao
    Liang, Jiaxuan
    Liu, Rong
    Song, Yunquan
    Zhang, Min
    [J]. MATHEMATICS, 2022, 10 (16)
  • [28] Robust model selection using the out-of-bag bootstrap in linear regression
    Fazli Rabbi
    Alamgir Khalil
    Ilyas Khan
    Muqrin A. Almuqrin
    Umair Khalil
    Mulugeta Andualem
    [J]. Scientific Reports, 12
  • [29] Bootstrap inference for instrumental variable models with many weak instruments
    Wang, Wenjie
    Kaffo, Maximilien
    [J]. JOURNAL OF ECONOMETRICS, 2016, 192 (01) : 231 - 268
  • [30] Robust model selection using the out-of-bag bootstrap in linear regression
    Rabbi, Fazli
    Khalil, Alamgir
    Khan, Ilyas
    Almuqrin, Muqrin A.
    Khalil, Umair
    Andualem, Mulugeta
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)