Robust distributed estimation and variable selection for massive datasets via rank regression

被引:1
|
作者
Luan, Jiaming [1 ]
Wang, Hongwei [1 ]
Wang, Kangning [1 ]
Zhang, Benle [1 ]
机构
[1] Shandong Technol & Business Univ, 191 Binhai Middle Rd, Yantai 264005, Peoples R China
关键词
Massive data; Robustness; Communication efficient; Variable selection;
D O I
10.1007/s10463-021-00803-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Rank regression is a robust modeling tool; it is challenging to implement it for the distributed massive data owing to memory constraints. In practice, the massive data may be distributed heterogeneously from machine to machine; how to incorporate the heterogeneity is also an interesting issue. This paper proposes a distributed rank regression (DR2), which can be implemented in the master machine by solving a weighted least-squares and adaptive when the data are heterogeneous. Theoretically, we prove that the resulting estimator is statistically as efficient as the global rank regression estimator. Furthermore, based on the adaptive LASSO and a newly defined distributed BIC-type tuning parameter selector, we propose a distributed regularized rank regression (DR3), which can make consistent variable selection and can also be easily implemented by using the LARS algorithm on the master machine. Simulation results and real data analysis are included to validate our method.
引用
收藏
页码:435 / 450
页数:16
相关论文
共 50 条
  • [31] Distributed smoothed rank regression with heterogeneous errors for massive data
    Xiaohui Yuan
    Xinran Zhang
    Yue Wang
    Chunjie Wang
    [J]. Journal of the Korean Statistical Society, 2023, 52 : 1078 - 1103
  • [32] ROBUST ESTIMATION OF REDUCED RANK MODELS TO LARGE SPATIAL DATASETS
    Jelsema, Casey M.
    Paul, Rajib
    McKean, Joseph W.
    [J]. REVSTAT-STATISTICAL JOURNAL, 2020, 18 (02) : 203 - 221
  • [33] Distributed smoothed rank regression with heterogeneous errors for massive data
    Yuan, Xiaohui
    Zhang, Xinran
    Wang, Yue
    Wang, Chunjie
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2023, 52 (04) : 1078 - 1103
  • [34] Variable selection in rank regression for analyzing longitudinal data
    Fu, Liya
    Wang, You-Gan
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (08) : 2447 - 2458
  • [35] ROBUST CRITERION FOR VARIABLE SELECTION IN LINEAR REGRESSION
    Patil, A. B.
    Kashid, D. N.
    [J]. INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2009, 5 (02): : 509 - 521
  • [36] Resampling methods for variable selection in robust regression
    Wisnowski, JW
    Simpson, JR
    Montgomery, DC
    Runger, GC
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 43 (03) : 341 - 355
  • [37] Robust variable selection in the logistic regression model
    Jiang, Yunlu
    Zhang, Jiantao
    Huang, Yingqiang
    Zou, Hang
    Huang, Meilan
    Chen, Fanhong
    [J]. HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2021, 50 (05): : 1572 - 1582
  • [38] Robust adaptive model selection and estimation for partial linear varying coefficient models in rank regression
    Sun, Xiaofei
    Wang, Kangning
    Lin, Lu
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2018, 47 (01) : 54 - 65
  • [39] Robust adaptive model selection and estimation for partial linear varying coefficient models in rank regression
    Xiaofei Sun
    Kangning Wang
    Lu Lin
    [J]. Journal of the Korean Statistical Society, 2018, 47 : 54 - 65
  • [40] Weighted LAD-LASSO method for robust parameter estimation and variable selection in regression
    Arslan, Olcay
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) : 1952 - 1965