Variable selection in high-dimensional regression: a nonparametric procedure for business failure prediction

被引:12
|
作者
Amendola, Alessandra [1 ]
Giordano, Francesco [1 ]
Parrella, Maria Lucia [1 ]
Restaino, Marialuisa [1 ]
机构
[1] Univ Salerno, Dept Econ & Stat, Via Giovanni Paolo II 132, Salerno, Italy
关键词
variable selection; business failure prediction; out-of-sample and out-of-time evaluation; BANKRUPTCY PREDICTION; FINANCIAL RATIOS; DISCRIMINANT-ANALYSIS; DEFAULT RISK; PROBABILITY; ACCURACY; DISTRESS; MODEL;
D O I
10.1002/asmb.2240
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Business failure prediction models are important in providing warning for preventing financial distress and giving stakeholders time to react in a timely manner to a crisis. The empirical approach to corporate distress analysis and forecasting has recently attracted new attention from financial institutions, academics, and practitioners. In fact, this field is as interesting today as it was in the 1930s, and over the last 80 years, a remarkable body of both theoretical and empirical studies on this topic has been published. Nevertheless, some issues are still under investigation, such as the selection of financial ratios to define business failure and the identification of an optimal subset of predictors. For this purpose, there exist a large number of methods that can be used, although their drawbacks are usually neglected in this context. Moreover, most variable selection procedures are based on some very strict assumptions (linearity and additivity) that make their application difficult in business failure prediction. This paper proposes to overcome these limits by selecting relevant variables using a nonparametric method named Rodeo that is consistent even when the aforementioned assumptions are not satisfied. We also compare Rodeo with two other variable selection methods (Lasso and Adaptive Lasso), and the empirical results demonstrate that our proposed procedure outperforms the others in terms of positive/negative predictive value and is able to capture the nonlinear effects of the selected variables. Copyright (c) 2017 John Wiley & Sons, Ltd.
引用
收藏
页码:355 / 368
页数:14
相关论文
共 50 条
  • [41] Nonparametric Additive Regression for High-Dimensional Group Testing Data
    Zuo, Xinlei
    Ding, Juan
    Zhang, Junjian
    Xiong, Wenjun
    [J]. MATHEMATICS, 2024, 12 (05)
  • [42] Empirical Study on High-Dimensional Variable Selection and Prediction Under Competing Risks
    Hou, Jiayi
    Xu, Ronghui
    [J]. NEW FRONTIERS OF BIOSTATISTICS AND BIOINFORMATICS, 2018, : 421 - 440
  • [43] Variable selection and estimation in high-dimensional models
    Horowitz, Joel L.
    [J]. CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2015, 48 (02): : 389 - 407
  • [44] Variable selection for high-dimensional incomplete data
    Liang, Lixing
    Zhuang, Yipeng
    Yu, Philip L. H.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 192
  • [45] Variable selection in high-dimensional partially linear additive models for composite quantile regression
    Guo, Jie
    Tang, Manlai
    Tian, Maozai
    Zhu, Kai
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 65 : 56 - 67
  • [46] High-dimensional graphs and variable selection with the Lasso
    Meinshausen, Nicolai
    Buehlmann, Peter
    [J]. ANNALS OF STATISTICS, 2006, 34 (03): : 1436 - 1462
  • [47] SCAD-penalized quantile regression for high-dimensional data analysis and variable selection
    Amin, Muhammad
    Song, Lixin
    Thorlie, Milton Abdul
    Wang, Xiaoguang
    [J]. STATISTICA NEERLANDICA, 2015, 69 (03) : 212 - 235
  • [48] Sparse Bayesian variable selection in high-dimensional logistic regression models with correlated priors
    Ma, Zhuanzhuan
    Han, Zifei
    Ghosh, Souparno
    Wu, Liucang
    Wang, Min
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2024, 17 (01)
  • [49] SPATIAL BAYESIAN VARIABLE SELECTION AND GROUPING FOR HIGH-DIMENSIONAL SCALAR-ON-IMAGE REGRESSION
    Li, Fan
    Zhang, Tingting
    Wang, Quanli
    Gonzalez, Marlen Z.
    Maresh, Erin L.
    Coan, James A.
    [J]. ANNALS OF APPLIED STATISTICS, 2015, 9 (02): : 687 - 713
  • [50] Variable selection in the single-index quantile regression model with high-dimensional covariates
    Kuruwita, C. N.
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (03) : 1120 - 1132