Feature Selection in a Credit Scoring Model

被引:19
|
作者
Laborda, Juan [1 ]
Ryoo, Seyong [2 ]
机构
[1] Univ Carlos III, Dept Business Adm, Madrid 28903, Spain
[2] Katholieke Univ Leuven, Leuven Stat Res Ctr, B-3000 Leuven, Belgium
关键词
operational research in banking; machine learning; credit scoring; classification algorithms; feature selection methods; SUPPORT VECTOR MACHINES; ART CLASSIFICATION ALGORITHMS; LEARNING-METHODS; PREDICTION; BANKRUPTCY; REGRESSION; PROBABILITY; ACCURACY;
D O I
10.3390/math9070746
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper proposes different classification algorithms-logistic regression, support vector machine, K-nearest neighbors, and random forest-in order to identify which candidates are likely to default for a credit scoring model. Three different feature selection methods are used in order to mitigate the overfitting in the curse of dimensionality of these classification algorithms: one filter method (Chi-squared test and correlation coefficients) and two wrapper methods (forward stepwise selection and backward stepwise selection). The performances of these three methods are discussed using two measures, the mean absolute error and the number of selected features. The methodology is applied for a valuable database of Taiwan. The results suggest that forward stepwise selection yields superior performance in each one of the classification algorithms used. The conclusions obtained are related to those in the literature, and their managerial implications are analyzed.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] An Effective Credit Scoring Model Implementation by Optimal Feature Selection Scheme
    Lenka, Sudhansu R.
    Bisoy, Sukant Kishoro
    Priyadarshini, Rojalina
    Hota, Jhalak
    Barik, Rabindra K.
    [J]. 2021 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2021, : 106 - 109
  • [2] Feature selection based on SVM for credit scoring
    Yao, Ping
    [J]. PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NATURAL COMPUTING, VOL II, 2009, : 44 - 47
  • [3] A local binary social spider algorithm for feature selection in credit scoring model
    Zhang, Zaimei
    Li, Yitan
    Liu, Yan
    Liu, Siming
    [J]. APPLIED SOFT COMPUTING, 2023, 144
  • [4] Business-Oriented Feature Selection for Hybrid Classification Model of Credit Scoring
    Chornous, Galyna
    Nikolskyi, Ihor
    [J]. 2018 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2018, : 397 - 401
  • [5] Data mining feature selection for credit scoring models
    Liu, Y
    Schumann, M
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2005, 56 (09) : 1099 - 1108
  • [6] Combination of feature selection approaches with SVM in credit scoring
    Chen, Fei-Long
    Li, Feng-Chia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (07) : 4902 - 4909
  • [7] A hybrid model with novel feature selection method and enhanced voting method for credit scoring
    Yao, Jianrong
    Wang, Zhongyi
    Wang, Lu
    Zhang, Zhebin
    Jiang, Hui
    Yan, Surong
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (03) : 2565 - 2579
  • [8] An application of locally linear model tree algorithm with combination of feature selection in credit scoring
    Siami, Mohammad
    Gholamian, Mohammad Reza
    Basiri, Javad
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2014, 45 (10) : 2213 - 2222
  • [9] Application of the VNS heuristic for feature selection in credit scoring problems
    Helder, Victor Gomes
    Filomena, Tiago Pascoal
    Ferreira, Luciano
    Kirch, Guilherme
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2022, 9
  • [10] On the Effect of Search Strategies On Wrapper Feature Selection in Credit Scoring
    Waad, Bouaguel
    Ghazi, Bel Mufti
    Mohamed, Limam
    [J]. 2013 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2013, : 218 - 223