Feature selection based on SVM for credit scoring

被引:4
|
作者
Yao, Ping [1 ]
机构
[1] Heilongjiang Inst Sci & Technol, Sch Econ & Management, Harbin 150027, Peoples R China
关键词
credit scoring; feature selection; support vector machine; t-test; principle component analysis; factor analysis; stepwise regression; rough set; classification and regression tree; multivariate adaptive regression splines;
D O I
10.1109/CINC.2009.36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the credit industry has been growing rapidly, huge number of consumers' credit data are collected by the credit department of the bank and credit scoring has become a very important issue. Usually, a large amount of redundant information and features are involved in the credit dataset, which leads to lower accuracy and higher complexity of the credit scoring model, so, effective feature selection methods are necessary for credit dataset with huge number of features. This paper aims at comparing seven well-known feature selection methods for credit scoring. Which are t-test, principle component analysis (PCA), factor analysis (FA), stepwise regression, Rough Set (RS), Classification and regression tree (CART) and Multivariate adaptive regression splines (MARS). Support vector machine (SVM) is used as the classification model. Two credit scoring databases are used in order to provide a reliable conclusion. Regarding the experimental results, the CART and MARS methods outperform the other methods by the overall accuracy and type I error and type II error.
引用
收藏
页码:44 / 47
页数:4
相关论文
共 50 条
  • [1] Combination of feature selection approaches with SVM in credit scoring
    Chen, Fei-Long
    Li, Feng-Chia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (07) : 4902 - 4909
  • [2] Integrated framework for profit-based feature selection and SVM classification in credit scoring
    Maldonado, Sebastian
    Bravo, Cristian
    Lopez, Julio
    Perez, Juan
    [J]. DECISION SUPPORT SYSTEMS, 2017, 104 : 113 - 121
  • [3] SVM-based Credit Rating and Feature Selection
    Qin, Yu-qiang
    Qi, Yu-dong
    Ying, Hui
    [J]. MATERIALS, MACHINES AND DEVELOPMENT OF TECHNOLOGIES FOR INDUSTRIAL PRODUCTION, 2014, 618 : 573 - +
  • [4] Feature Selection in a Credit Scoring Model
    Laborda, Juan
    Ryoo, Seyong
    [J]. MATHEMATICS, 2021, 9 (07)
  • [5] Credit scoring with a feature selection approach based deep learning
    Van-Sang Ha
    Ha-Nam Nguyen
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON MECHANICAL, INDUSTRIAL, AND MANUFACTURING TECHNOLOGIES (MIMT 2016), 2016, 54
  • [6] BAT algorithm based feature selection: Application in credit scoring
    Tripathi, Diwakar
    Reddy, B. Ramachandra
    Reddy, Y. C. A. Padmanabha
    Shukla, Alok Kumar
    Kumar, Ravi Kant
    Sharma, Neeraj Kumar
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (05) : 5561 - 5570
  • [7] BAT algorithm based feature selection: Application in credit scoring
    Tripathi, Diwakar
    Ramachandra Reddy, B.
    Padmanabha Reddy, Y.C.A.
    Shukla, Alok Kumar
    Kumar, Ravi Kant
    Sharma, Neeraj Kumar
    [J]. Journal of Intelligent and Fuzzy Systems, 2021, 41 (05): : 5561 - 5570
  • [8] Rough set and Tabu search based feature selection for credit scoring
    Wang, Jue
    Guo, Kun
    Wang, Shouyang
    [J]. ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 2419 - 2426
  • [9] Profit-based credit scoring based on robust optimization and feature selection
    Lopez, Julio
    Maldonado, Sebastian
    [J]. INFORMATION SCIENCES, 2019, 500 : 190 - 202
  • [10] Fusion of feature selection methods for pairwise scoring SVM
    Mak, Man-Wai
    Kung, Sun-Yuan
    [J]. NEUROCOMPUTING, 2008, 71 (16-18) : 3104 - 3113