An efficient method for feature selection in linear regression based on an extended Akaike's information criterion

被引:0
|
作者
Vetrov, D. P. [1 ]
Kropotov, D. A. [2 ]
Ptashko, N. O. [1 ]
机构
[1] Moscow MV Lomonosov State Univ, Fac Computat Math & Cybernet, Moscow 119992, Russia
[2] Russian Acad Sci, Dorodnicyn Comp Ctr, Moscow 119333, Russia
基金
俄罗斯基础研究基金会;
关键词
pattern recognition; linear regression; feature selection; Akaike's information criterion;
D O I
10.1134/S096554250911013X
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
A method for feature selection in linear regression based on an extension of Akaike's information criterion is proposed. The use of classical Akaike's information criterion (AIC) for feature selection assumes the exhaustive search through all the subsets of features, which has unreasonably high computational and time cost. A new information criterion is proposed that is a continuous extension of AIC. As a result, the feature selection problem is reduced to a smooth optimization problem. An efficient procedure for solving this problem is derived. Experiments show that the proposed method enables one to efficiently select features in linear regression. In the experiments, the proposed procedure is compared with the relevance vector machine, which is a feature selection method based on Bayesian approach. It is shown that both procedures yield similar results. The main distinction of the proposed method is that certain regularization coefficients are identical zeros. This makes it possible to avoid the underfitting effect, which is a characteristic feature of the relevance vector machine. A special case (the so-called nondiagonal regularization) is considered in which both methods are identical.
引用
下载
收藏
页码:1972 / 1985
页数:14
相关论文
共 50 条
  • [21] Asymptotic post-selection inference for the Akaike information criterion
    Charkhi, Ali
    Claeskens, Gerda
    BIOMETRIKA, 2018, 105 (03) : 645 - 664
  • [23] Akaike Information Criterion for Selecting Variables in the Nested Error Regression Model
    Kubokawa, Tatsuya
    Srivastava, Muni S.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2012, 41 (15) : 2626 - 2642
  • [24] Conditional Akaike information criterion for generalized linear mixed models
    Yu, Dalei
    Yau, Kelvin K. W.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (03) : 629 - 644
  • [25] Piecewise Regression through the Akaike Information Criterion using Mathematical Programming
    Gkioulekas, Ioannis
    Papageorgiou, Lazaros G.
    IFAC PAPERSONLINE, 2018, 51 (15): : 730 - 735
  • [26] Akaike's information criterion in generalized estimating equations
    Pan, W
    BIOMETRICS, 2001, 57 (01) : 120 - 125
  • [27] Akaike's information criterion for a measure of linkage disequilibrium
    K. Shimo-onoda
    T. Tanaka
    K. Furushima
    T. Nakajima
    S. Toh
    S. Harata
    K. Yone
    S. Komiya
    H. Adachi
    E. Nakamura
    H. Fujimiya
    I. Inoue
    Journal of Human Genetics, 2002, 47 : 649 - 655
  • [28] Akaike's information criterion for a measure of linkage disequilibrium
    Shimo-onoda, K
    Tanaka, T
    Furushima, K
    Nakajima, T
    Toh, S
    Harata, S
    Yone, K
    Komiya, S
    Adachi, H
    Nakamura, E
    Fujimiya, H
    Inoue, I
    JOURNAL OF HUMAN GENETICS, 2002, 47 (12) : 649 - 655
  • [29] Finite Sample Improvement of Akaike's Information Criterion
    Saumard, Adrien
    Navarro, Fabien
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (10) : 6328 - 6343
  • [30] Estimation of time -varying linear regression with unknown time -volatility via continuous generalization of the Akaike Information Criterion
    Moscow Institute of Physics and Technology, Department of Intelligent Systems, Moscow, Russia
    不详
    不详
    World Acad. Sci. Eng. Technol., 2009, (151-156):