A Deep Learning Approach for Loan Default Prediction Using Imbalanced Dataset

被引:2
|
作者
Owusu, Ebenezer [1 ]
Quainoo, Richard [1 ]
Mensah, Solomon [1 ]
Appati, Justice Kwame [1 ]
机构
[1] Univ Ghana, Accra, Ghana
关键词
Adaptive Synthetic (ADASYN) algorithm; Deep neural network; Imbalanced dataset; Loan-default; Prediction;
D O I
10.4018/IJIIT.318672
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lending institutions face key challenges in making accurate predictions of loan defaults. Large sums of money given as loans are defaulted and this causes a substantial loss in business. This study addresses loan default in online peer-to-peer lending activities. Data for the study was obtained from the online lending club on the Kaggle platform. The loan status was chosen as the dependent variable and was classified discretely into "default" and "fully paid" loans. The dataset is preprocessed to eliminate all irrelevant instances. Due to the imbalanced nature of the dataset, the adaptive synthetic (ADASYN) oversampling algorithm is used to balance the data by oversampling the minority class with synthetic data instances. Deep neural network (DNN) is used for prediction. A prediction accuracy of 94.1% is realized and this emerged as the highest score from several trials with variations in batch sizes and epochs. The result of the study clearly shows that the proposed procedure is very promising.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] A deep metric learning approach for weakly supervised loan default prediction
    Zhuang, Kai
    Wu, Sen
    Gao, Xiaonan
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (04) : 5007 - 5019
  • [2] Loan Default Prediction with Deep Learning and Muddling Label Regularization
    Jiang, Weiwei
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (07) : 1340 - 1342
  • [3] Default Prediction for Real Estate Companies with Imbalanced Dataset
    Dong, Yuan-Xiang
    Xiao, Zhi
    Xiao, Xue
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2014, 10 (02): : 314 - 333
  • [4] Transfer Learning and Loan Default Prediction
    Feinberg, Tzvi
    Semenov, Alexander
    Guan, Yongpei
    Grigoriev, Dmitry
    Prokhorov, Artem
    [J]. COMPUTATIONAL DATA AND SOCIAL NETWORKS, CSONET 2021, 2021, 13116 : 387 - 388
  • [5] Deep learning approach for diabetes prediction using PIMA Indian dataset
    Naz, Huma
    Ahuja, Sachin
    [J]. JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2020, 19 (01) : 391 - 403
  • [6] Deep learning approach for diabetes prediction using PIMA Indian dataset
    Huma Naz
    Sachin Ahuja
    [J]. Journal of Diabetes & Metabolic Disorders, 2020, 19 : 391 - 403
  • [7] A Naive Bayes approach to fraud prediction in loan default
    Eweoya, I. O.
    Adebiyi, A. A.
    Azeta, A. A.
    Chidozie, F.
    Agono, F. O.
    Guembe, B.
    [J]. 3RD INTERNATIONAL CONFERENCE ON SCIENCE AND SUSTAINABLE DEVELOPMENT (ICSSD 2019): SCIENCE, TECHNOLOGY AND RESEARCH: KEYS TO SUSTAINABLE DEVELOPMENT, 2019, 1299
  • [8] Explainable prediction of loan default based on machine learning models
    Zhu, Xu
    Chu, Qingyong
    Song, Xinchang
    Hu, Ping
    Peng, Lu
    [J]. Data Science and Management, 2023, 6 (03): : 123 - 133
  • [9] Impact of mortgage soft information in loan pricing on default prediction using machine learning
    Luong, Thi Mai
    Scheule, Harald
    Wanzare, Nitya
    [J]. INTERNATIONAL REVIEW OF FINANCE, 2023, 23 (01) : 158 - 186
  • [10] A hybrid machine learning approach to cerebral stroke prediction based on imbalanced medical dataset
    Liu, Tianyu
    Fan, Wenhui
    Wu, Cheng
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 101