Variable selection in linear regression

被引:80
|
作者
Lindsey, Charles [1 ]
Sheather, Simon [2 ]
机构
[1] StataCorp, College Stn, TX USA
[2] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
来源
STATA JOURNAL | 2010年 / 10卷 / 04期
关键词
st0213; vselect; variable selection; regress; nestreg; MODEL SELECTION;
D O I
10.1177/1536867X1101000407
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
We present a new Stata program, vselect, that helps users perform variable selection after performing a linear regression. Options for stepwise methods such as forward selection and backward elimination are provided. The user may specify Mallows's C-p, Akaike's information criterion, Akaike's corrected information criterion, Bayesian information criterion, or R-2 adjusted as the information criterion for the selection. When the user specifies the best subset option, the leaps-and-bounds algorithm (Furnival and Wilson, Technometrics 16: 499-511) is used to determine the best subsets of each predictor size. All the previously mentioned information criteria are reported for each of these subsets. We also provide options for doing variable selection only on certain predictors (as in [R] nestreg) and support for weighted linear regression. All options are demonstrated on real datasets with varying numbers of predictors.
引用
收藏
页码:650 / 669
页数:20
相关论文
共 50 条
  • [1] On variable selection in linear regression
    Kabaila, P
    [J]. ECONOMETRIC THEORY, 2002, 18 (04) : 913 - 925
  • [2] Variable Selection in Linear Regression With Many Predictors
    Cai, Airong
    Tsay, Ruey S.
    Chen, Rong
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2009, 18 (03) : 573 - 591
  • [3] Variable selection and transformation in linear regression models
    Yeo, IK
    [J]. STATISTICS & PROBABILITY LETTERS, 2005, 72 (03) : 219 - 226
  • [4] Variable selection in functional linear concurrent regression
    Ghosal, Rahul
    Maity, Arnab
    Clark, Timothy
    Longo, Stefano B.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2020, 69 (03) : 565 - 587
  • [5] RESPONSE VARIABLE SELECTION IN MULTIVARIATE LINEAR REGRESSION
    Khare, Kshitij
    Su, Zhihua
    [J]. STATISTICA SINICA, 2024, 34 (03) : 1325 - 1345
  • [6] Variable Selection in Multivariate Functional Linear Regression
    Yeh, Chi-Kuang
    Sang, Peijun
    [J]. STATISTICS IN BIOSCIENCES, 2023,
  • [7] ROBUST CRITERION FOR VARIABLE SELECTION IN LINEAR REGRESSION
    Patil, A. B.
    Kashid, D. N.
    [J]. INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2009, 5 (02): : 509 - 521
  • [8] Bayesian variable and transformation selection in linear regression
    Hoeting, JA
    Raftery, AE
    Madigan, D
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2002, 11 (03) : 485 - 507
  • [9] Variable selection in partial linear regression using the least angle regression
    Seo, Han Son
    Yoon, Min
    Lee, Hakbae
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (06) : 937 - 944
  • [10] Variable selection in multivariate linear regression with random predictors
    Mbina, Alban Mbina
    Nkiet, Guy Martial
    N'guessan, Assi
    [J]. SOUTH AFRICAN STATISTICAL JOURNAL, 2023, 57 (01) : 27 - 44