Integration of unsupervised and supervised machine learning algorithms for credit risk assessment

被引:72
|
作者
Wang Bao [1 ]
Ning Lianju [1 ]
Kong Yue [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Econ & Management, POB 164 10,Xitucheng Rd, Beijing 100876, Peoples R China
[2] Beijing Univ Chem Technol, Dept Pharmaceut Engn, State Key Lab Chem Resource Engn, POB 53,15 Beisanhuan East Rd, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
Credit scoring; Ensemble model; Unsupervised machine learning; Supervised machine learning; Kohonen's self-organizing maps (SOM); SCORING MODEL; PREDICTION; CLASSIFICATION; ACCURACY;
D O I
10.1016/j.eswa.2019.02.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For the sake of credit risk assessment, credit scoring has become a critical tool to discriminate "bad" applicants from "good" applicants for financial institutions. Accordingly, a wide range of supervised machine learning algorithms have been successfully applied to credit scoring: however, integration of unsupervised learning with supervised learning in this field has drawn little consideration. In this work, we propose a combination strategy of integrating unsupervised learning with supervised learning for credit risk assessment. The difference between our work and other previous work on unsupervised integration is that we apply unsupervised learning techniques at two different stages: the consensus stage and dataset clustering stage. Comparisons of model performance are performed based on three credit datasets in four groups: individual models, individual models+ consensus model, clustering+ individual models, clustering + individual models+ consensus model. As a result, integration at either the consensus stage or dataset clustering stage is effective on improving the performance of credit scoring models. Moreover, the combination of the two stages achieves the best performance, thereby confirming the superiority of the proposed integration of unsupervised and supervised machine learning algorithms, which boost our confidence that this strategy can be extended to many other credit datasets from financial institutions. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:301 / 315
页数:15
相关论文
共 50 条
  • [31] A comparative study of supervised/unsupervised machine learning algorithms with feature selection approaches to predict student performance
    Hamoud, Alaa Khalaf
    Alasady, Ali Salah
    Awadh, Wid Akeel
    Dahr, Jasim Mohammed
    Kamel, Mohammed B. M.
    Humadi, Aqeel Majeed
    Najm, Ihab Ahmed
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2023, 15 (04) : 393 - 409
  • [32] Comprehensive Review On Supervised Machine Learning Algorithms
    Gianey, Hemant Kumar
    Choudhary, Rishabh
    [J]. 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 37 - 43
  • [33] Unsupervised Assessment of Balance and Falls Risk Using a Smartphone and Machine Learning
    Greene, Barry R.
    McManus, Killian
    Ader, Lilian Genaro Motti
    Caulfield, Brian
    [J]. SENSORS, 2021, 21 (14)
  • [34] Combining Supervised and Unsupervised Learning Algorithms for Human Activity Recognition
    Budisteanu, Elena-Alexandra
    Mocanu, Irina Georgiana
    [J]. SENSORS, 2021, 21 (18)
  • [35] A COMPARISON OF POSTURE RECOGNITION USING SUPERVISED AND UNSUPERVISED LEARNING ALGORITHMS
    Kiran, Maleeha
    Chan, Chee Seng
    Lai, Weng Kin
    Ali, Kyaw Kyaw Hitke
    Khalifa, Othman
    [J]. PROCEEDINGS OF THE 24TH EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2010, 2010, : 226 - +
  • [36] Unsupervised and Supervised Machine Learning in User Modeling for Intelligent Learning Environments
    Amershi, Saleema
    Conati, Cristina
    [J]. 2007 INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2007, : 72 - 81
  • [37] Research on personal credit risk assessment model based on machine learning
    An, Ran
    Liu, Yuanji
    Ke, Gufeng
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2021, 128 : 187 - 187
  • [38] Application of Machine Learning in Credit Risk Assessment: A Prelude to Smart Banking
    Shoumo, Syed Zamil Hasan
    Dhruba, Mir Ishrak Maheer
    Hossain, Sazzad
    Ghani, Nawab Haider
    Arif, Hossain
    Islam, Samiul
    [J]. PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 2023 - 2028
  • [39] Credit risk assessment of small and micro enterprise based on machine learning
    Gu, Zhouyi
    Lv, Jiayan
    Wu, Bingya
    Hu, Zhihui
    Yu, Xinwei
    [J]. HELIYON, 2024, 10 (05)
  • [40] Assessment of the regeneration of landslides areas using unsupervised and supervised methods and explainable machine learning models
    Arrogante-Funes, Patricia
    Bruzon, Adrian G.
    Alvarez-Ripado, Ariadna
    Arrogante-Funes, Fatima
    Martin-Gonzalez, Fidel
    Novillo, Carlos J.
    [J]. LANDSLIDES, 2024, 21 (02) : 275 - 290