Integration of unsupervised and supervised machine learning algorithms for credit risk assessment

被引:71
|
作者
Wang Bao [1 ]
Ning Lianju [1 ]
Kong Yue [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Econ & Management, POB 164 10,Xitucheng Rd, Beijing 100876, Peoples R China
[2] Beijing Univ Chem Technol, Dept Pharmaceut Engn, State Key Lab Chem Resource Engn, POB 53,15 Beisanhuan East Rd, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
Credit scoring; Ensemble model; Unsupervised machine learning; Supervised machine learning; Kohonen's self-organizing maps (SOM); SCORING MODEL; PREDICTION; CLASSIFICATION; ACCURACY;
D O I
10.1016/j.eswa.2019.02.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For the sake of credit risk assessment, credit scoring has become a critical tool to discriminate "bad" applicants from "good" applicants for financial institutions. Accordingly, a wide range of supervised machine learning algorithms have been successfully applied to credit scoring: however, integration of unsupervised learning with supervised learning in this field has drawn little consideration. In this work, we propose a combination strategy of integrating unsupervised learning with supervised learning for credit risk assessment. The difference between our work and other previous work on unsupervised integration is that we apply unsupervised learning techniques at two different stages: the consensus stage and dataset clustering stage. Comparisons of model performance are performed based on three credit datasets in four groups: individual models, individual models+ consensus model, clustering+ individual models, clustering + individual models+ consensus model. As a result, integration at either the consensus stage or dataset clustering stage is effective on improving the performance of credit scoring models. Moreover, the combination of the two stages achieves the best performance, thereby confirming the superiority of the proposed integration of unsupervised and supervised machine learning algorithms, which boost our confidence that this strategy can be extended to many other credit datasets from financial institutions. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:301 / 315
页数:15
相关论文
共 50 条
  • [1] Credit Risk Assessment Using Machine Learning Algorithms
    Attigeri, Girija V.
    Pai, M. M. Manohara
    Pai, Radhika M.
    [J]. ADVANCED SCIENCE LETTERS, 2017, 23 (04) : 3649 - 3653
  • [2] CREDIT RISK EVALUATION BASED ON SUPERVISED LEARNING ALGORITHMS
    Novakovic, Jasmina
    Veljovic, Alempije
    [J]. METALURGIA INTERNATIONAL, 2012, 17 (05): : 195 - 203
  • [3] Combining supervised and unsupervised machine learning algorithms to predict the learners' learning styles
    El Aissaoui, Ouafae
    El Alami El Madani, Yasser
    Oughdir, Lahcen
    El Allioui, Youssouf
    [J]. SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 : 87 - 96
  • [4] Supervised Machine Learning Algorithms for Credit Card Fraud Detection: A Comparison
    Khatri, Samidha
    Arora, Aishwarya
    Agrawal, Arun Prakash
    [J]. PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 680 - 683
  • [5] Credit Risk Analysis Using Machine Learning Algorithms
    Kalayci, Sacide
    Kamasak, Mustafa
    Arslan, Secil
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [6] Predicting of Credit Risk Using Machine Learning Algorithms
    Antony, Tisa Maria
    Kumar, B. Sathish
    [J]. ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 99 - 114
  • [7] Credit Risk Analysis Using Machine-Learning Algorithms
    Alagoz, Gokhan
    Canakoglu, Ethem
    [J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [8] On integrating unsupervised and supervised classification for credit risk evaluation
    Zakrzewska, Danuta
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2007, 36 (1A): : 98 - 102
  • [9] ANALYSING MACHINE LEARNING METHODS AND CREDIT RISK ASSESSMENT
    Coelho, Felipe Fernandes
    de Lima Amorim, Daniel Penido
    de Camargos, Marcos Antonio
    [J]. REVISTA GESTAO & TECNOLOGIA-JOURNAL OF MANAGEMENT AND TECHNOLOGY, 2021, 21 (01): : 89 - 116
  • [10] Ensembling Supervised and Unsupervised Machine Learning Algorithms for Detecting Distributed Denial of Service Attacks
    Das, Saikat
    Ashrafuzzaman, Mohammad
    Sheldon, Frederick T.
    Shiva, Sajjan
    [J]. ALGORITHMS, 2024, 17 (03)