Better subset regression

被引:5
|
作者
Xiong, Shifeng [1 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Best subset regression; Combinatorial optimization; em algorithm; Orthogonal design; Sure screening property; Variable selection; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; SUPERSATURATED DESIGNS; NONNEGATIVE GARROTE; NP-DIMENSIONALITY; ORACLE PROPERTIES; BOUND ALGORITHM; EM ALGORITHM; BRANCH; LASSO;
D O I
10.1093/biomet/ast041
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper studies the relationship between model fitting and screening performance to find efficient screening methods for high-dimensional linear regression models. Under a sparsity assumption we show in a general asymptotic setting that a subset that includes the true submodel always yields a smaller residual sum of squares than those that do not. To seek such a subset, we consider the optimization problem associated with best subset regression. An em algorithm, known as orthogonalizing subset screening, and its accelerated version are proposed for searching for the best subset. Although the algorithms do not always find the best subset, their monotonicity makes the subset fit the data better than initial subsets, and thus the subset can have better screening performance asymptotically. Simulation results show that our methods are very competitive.
引用
收藏
页码:71 / 84
页数:14
相关论文
共 50 条
  • [31] Symbolic regression for better specification
    Tsionas, Mike G.
    Assaf, A. George
    INTERNATIONAL JOURNAL OF HOSPITALITY MANAGEMENT, 2020, 91
  • [32] Bootstrap choice of cost complexity for better subset selection
    Rao, JS
    STATISTICA SINICA, 1999, 9 (01) : 273 - 287
  • [33] Bigger and better subset-sum-distinct sets
    Maltby, R
    MATHEMATIKA, 1997, 44 (87) : 56 - 60
  • [34] Estimating LAD regression coefficients with best subset points
    Choi, Hyun Jip
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2008, 37 (09) : 1799 - 1809
  • [35] Conditional Uncorrelation and Efficient Subset Selection in Sparse Regression
    Wang, Jianji
    Zhang, Shupei
    Liu, Qi
    Du, Shaoyi
    Guo, Yu-Cheng
    Zheng, Nanning
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) : 10458 - 10467
  • [36] INFLATION OF R2 IN BEST SUBSET REGRESSION
    RENCHER, AC
    PUN, FC
    TECHNOMETRICS, 1980, 22 (01) : 49 - 53
  • [37] Complete subset least squares support vector regression
    Qiu, Yue
    ECONOMICS LETTERS, 2021, 200
  • [38] Selection of the best calibration sample subset for multivariate regression
    Ferre, J
    Rius, FX
    ANALYTICAL CHEMISTRY, 1996, 68 (09) : 1565 - 1571
  • [39] Subset selection for multiple linear regression via optimization
    Park, Young Woong
    Klabjan, Diego
    JOURNAL OF GLOBAL OPTIMIZATION, 2020, 77 (03) : 543 - 574
  • [40] Ordered-Subset Ridge Regression in Image Reconstruction
    Pan, Jinxiao
    Han, Yan
    Liu, Liping
    Zhou, Tie
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 750 - +