New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers

被引:38
|
作者
Nalic, Jasmina [1 ]
Martinovic, Goran [1 ]
Zagar, Drago [1 ]
机构
[1] JJ Strossmayer Univ Osijek, Fac Elect Engn Comp Sci & Informat Technol Osijek, KnezaTrpimira 2b, Osijek 31000, Croatia
关键词
Credit scoring; Data mining; Ensemble classifier; Feature selection; Hybrid model;
D O I
10.1016/j.aei.2020.101130
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of this paper is to propose a new hybrid data mining model based on combination of various feature selection and ensemble learning classification algorithms, in order to support decision making process. The model is built through several stages. In the first stage, initial dataset is preprocessed and apart of applying different preprocessing techniques, we paid a great attention to the feature selection. Five different feature selection algorithms were applied and their results, based on ROC and accuracy measures of logistic regression algorithm, were combined based on different voting types. We also proposed a new voting method, called if-any, that outperformed all other voting methods, as well as a single feature selection algorithm's results. In the next stage, a four different classification algorithms, including generalized linear model, support vector machine, naive Bayes and decision tree, were performed based on dataset obtained in the feature selection process. These classifiers were combined in eight different ensemble models using soft voting method. Using the real dataset, the experimental results show that hybrid model that is based on features selected by if-any voting method and ensemble GLM + DT model performs the highest performance and outperforms all other ensemble and single classifier models.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring
    Koutanaei, Fatemeh Nemati
    Sajedi, Hedieh
    Khanbabaei, Mohammad
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2015, 27 : 11 - 23
  • [2] A novel hybrid credit scoring model based on ensemble feature selection and multilayer ensemble classification
    Tripathi, Diwakar
    Edla, Damodar Reddy
    Cheruku, Ramalingaswamy
    Kuppili, Venkatanareshbabu
    COMPUTATIONAL INTELLIGENCE, 2019, 35 (02) : 371 - 394
  • [3] A new hybrid ensemble credit scoring model based on classifiers consensus system approach
    Ala'raj, Maher
    Abbod, Maysam F.
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 : 36 - 55
  • [4] Data mining feature selection for credit scoring models
    Liu, Y
    Schumann, M
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2005, 56 (09) : 1099 - 1108
  • [5] A new hybrid credit scoring ensemble model with feature enhancement and soft voting weight optimization
    Yang, Dongqi
    Xiao, Binqing
    Cao, Mengya
    Shen, Huaqi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [6] Feature Selection in a Credit Scoring Model
    Laborda, Juan
    Ryoo, Seyong
    MATHEMATICS, 2021, 9 (07)
  • [7] BAT algorithm based feature selection: Application in credit scoring
    Tripathi, Diwakar
    Reddy, B. Ramachandra
    Reddy, Y. C. A. Padmanabha
    Shukla, Alok Kumar
    Kumar, Ravi Kant
    Sharma, Neeraj Kumar
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (05) : 5561 - 5570
  • [8] BAT algorithm based feature selection: Application in credit scoring
    Tripathi, Diwakar
    Ramachandra Reddy, B.
    Padmanabha Reddy, Y.C.A.
    Shukla, Alok Kumar
    Kumar, Ravi Kant
    Sharma, Neeraj Kumar
    Journal of Intelligent and Fuzzy Systems, 2021, 41 (05): : 5561 - 5570
  • [9] A Credit Scoring Model Based on Integrated Mixed Sampling and Ensemble Feature Selection: RBR XGB _
    Lin, Xiaobing
    Wu, Zhe
    Chen, Jianfa
    Huang, Lianfen
    Shi, Zhiyuan
    JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (05): : 1061 - 1068
  • [10] Improving Feature Selection for Credit Scoring Classification Using a Novel Hybrid Algorithm
    Qasim, Omar Saber
    Algamal, Zakariya Yahya
    THAILAND STATISTICIAN, 2021, 19 (03): : 593 - 605