Impact of Feature Selection Methods on the Perfromance of Credit Risk Classification Algorithms

被引:0
|
作者
Singh, N. P. [1 ]
Singh, Devender [2 ]
机构
[1] Management Dev Inst, Informat Management Area, Gurgaon, Gurugram, India
[2] AIMA AMU Phd Program, Aligarh, Uttar Pradesh, India
关键词
Feature Selection; Chi-Square; Gain Ratio; Information Gain; Relief F; symmetric uncertainty;
D O I
10.1109/aict47866.2019.8981771
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents ensembling of filter features selection algorithms for classification problem in the context of assessment of risk of credit for a financial institution. Feature selection is one of the most important aspect of data mining, and machine learning algorithm. The main objective of feature selection is reduction in computing resources, reduction in future data collection cost, reducing complexities of the model, avoiding overfitting, and increasing the performance of machine learning algorithms. In this paper the set of available variables are firstly reduced using filter feature selection methods such as chi-square, gain ratio, information gain, relief F, and symmetric uncertainly. In addition, ensemble feature selection of the input variables based on these individual methods is also used. The impact of feature selection is measured by fitting seven classification algorithms, i.e., Random Forest, C4.5, PART, C5.0, Bagging, Boosting, and MINI Linear. The performance of the models is compared by calculating parameters such as accuracy, sensitivity, specificity, positive predictive values, negatively predictive values, and AUC. The data used is German bank data of 1000 records and 20 features and one target variable
引用
收藏
页码:101 / 106
页数:6
相关论文
共 50 条
  • [1] Credit Risk Assessment Using Learning Algorithms for Feature Selection
    Hassani, Zeinab
    Meybodi, Mohsen Alambardar
    Hajihashemi, Vahid
    [J]. FUZZY INFORMATION AND ENGINEERING, 2020, 12 (04) : 529 - 544
  • [2] Hybridisation of Feature Selection and Classification Techniques in Credit Risk Assessment Modelling
    Sakri, Sapiah
    Othman, Jaizah
    Halid, Noreha
    [J]. KNOWLEDGE INNOVATION THROUGH INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_20), 2020, 327 : 367 - 380
  • [3] Hybridisation of feature selection and classification techniques in credit risk assessment modelling
    Sakri, Sapiah
    Othman, Jaizah
    Halid, Noreha
    [J]. Frontiers in Artificial Intelligence and Applications, 2020, 327 : 367 - 380
  • [4] Impact of feature selection methods on data classification for IDS
    Jiang, Shuai
    Xu, Xiaolong
    [J]. 2019 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2019, : 174 - 180
  • [5] An Analytic Hierarchy Model for Classification Algorithms Selection in Credit Risk Analysis
    Kou, Gang
    Wu, Wenshuai
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [6] A Comprehensive Study of Eleven Feature Selection Algorithms and their Impact on Text Classification
    Vora, Suchi
    Yang, Hui
    [J]. 2017 COMPUTING CONFERENCE, 2017, : 440 - 449
  • [7] Fault Detection in Microgrids Using Combined Classification Algorithms and Feature Selection Methods
    Ranjbar, S.
    Jamali, S.
    [J]. 2019 INTERNATIONAL CONFERENCE ON PROTECTION AND AUTOMATION OF POWER SYSTEM (IPAPS), 2019, : 17 - 21
  • [8] A VNS-EDA Algorithm-Based Feature Selection for Credit Risk Classification
    Chen, Wei
    Li, Zhongfei
    Guo, Jinchao
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [9] Comparison of classification algorithms using feature selection
    Juarez-Lopez, Alexander
    Hernandez-Torruco, Jose
    Hernandez-Ocana, Betania
    Chavez-Bosquez, Oscar
    [J]. 2021 MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC 2021), 2021,
  • [10] Genetic algorithms for clustering, feature selection and classification
    Tseng, LY
    Yang, SB
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1612 - 1616