Impact of Feature Selection Methods on the Perfromance of Credit Risk Classification Algorithms

被引:0
|
作者
Singh, N. P. [1 ]
Singh, Devender [2 ]
机构
[1] Management Dev Inst, Informat Management Area, Gurgaon, Gurugram, India
[2] AIMA AMU Phd Program, Aligarh, Uttar Pradesh, India
关键词
Feature Selection; Chi-Square; Gain Ratio; Information Gain; Relief F; symmetric uncertainty;
D O I
10.1109/aict47866.2019.8981771
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents ensembling of filter features selection algorithms for classification problem in the context of assessment of risk of credit for a financial institution. Feature selection is one of the most important aspect of data mining, and machine learning algorithm. The main objective of feature selection is reduction in computing resources, reduction in future data collection cost, reducing complexities of the model, avoiding overfitting, and increasing the performance of machine learning algorithms. In this paper the set of available variables are firstly reduced using filter feature selection methods such as chi-square, gain ratio, information gain, relief F, and symmetric uncertainly. In addition, ensemble feature selection of the input variables based on these individual methods is also used. The impact of feature selection is measured by fitting seven classification algorithms, i.e., Random Forest, C4.5, PART, C5.0, Bagging, Boosting, and MINI Linear. The performance of the models is compared by calculating parameters such as accuracy, sensitivity, specificity, positive predictive values, negatively predictive values, and AUC. The data used is German bank data of 1000 records and 20 features and one target variable
引用
收藏
页码:101 / 106
页数:6
相关论文
共 50 条
  • [41] Information-theoretic feature selection algorithms for text classification
    Novovicová, J
    Malík, A
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 3272 - 3277
  • [42] Stable feature selection and classification algorithms for multiclass microarray data
    Sebastian Student
    Krzysztof Fujarewicz
    [J]. Biology Direct, 7
  • [43] A COMPREHENSIVE EVALUATION OF FEATURE SELECTION ALGORITHMS IN HYPERSPECTRAL IMAGE CLASSIFICATION
    Vijouyeh, Hamed G.
    Taskin, Gulsen
    [J]. 2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 489 - 492
  • [44] Feature selection algorithms to reduce processing time in classification with SVMs
    Toledo-Perez D C
    Rodriguez-Resendiz J
    Gomez-Loenzo R A
    Martinez-Trinidad J F
    Carrasco-Ochoa J A
    [J]. 2021 XVII INTERNATIONAL ENGINEERING CONGRESS (CONIIN), 2021,
  • [45] Gender Classification Based on Feature Selection Using Genetic Algorithms
    Liu, Zhiming
    Bebis, George
    Veropoulos, Konstantinos
    [J]. PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS , PTS 1-3: NEW ASPECTS OF COMPUTERS, 2008, : 187 - 193
  • [46] Genetic Algorithms and Feature Selection for Improving the Classification Performance in Healthcare
    Alassaf, Alaa
    Alarbeed, Eman
    Alrasheed, Ghady
    Almirdasie, Abdulsalam
    Almutairi, Shahd
    Al-Hagery, Mohammed Abullah
    Saeed, Faisal
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 737 - 744
  • [47] Genetic algorithms for automised feature selection in a texture classification system
    Stolpmann, A
    Dooley, LS
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 1229 - 1232
  • [48] Performance Evaluation of Feature Selection Algorithms on Human Activity Classification
    Tulum, Gokalp
    Artug, N. Tugrul
    Bolat, Bulent
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [49] Comparison of feature selection and classification algorithms in identifying malicious executables
    Cai, D. Michael
    Gokhale, Maya
    Theiler, James
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (06) : 3156 - 3172
  • [50] Pattern and Feature Selection by Genetic Algorithms in Nearest Neighbor Classification
    Ishibuchi, Hisao
    Nakashinia, Tomoharu
    [J]. Journal of Advanced Computational Intelligence and Intelligent Informatics, 2000, 4 (02) : 138 - 145