Unmasking Banking Fraud: Unleashing the Power of Machine Learning and Explainable AI (XAI) on Imbalanced Data

被引:0
|
作者
Nobel, S. M. Nuruzzaman [1 ]
Sultana, Shirin [1 ]
Singha, Sondip Poul [1 ]
Chaki, Sudipto [1 ]
Mahi, Md. Julkar Nayeen [2 ]
Jan, Tony [3 ]
Barros, Alistair [4 ]
Whaiduzzaman, Md [3 ,4 ]
机构
[1] Bangladesh Univ Business & Technol, Dept Comp Sci & Engn, Dhaka 1216, Bangladesh
[2] Daffodil Int Univ, Dept Software Engn, Dhaka 1207, Bangladesh
[3] Torrens Univ, Design & Creat Technol, Brisbane, Qld 4006, Australia
[4] Queensland Univ Technol, Sch Informat Syst, Brisbane, Qld 4000, Australia
关键词
fraud detection; machine learning; logistic regression; decision tree; SVM; XGBoost; oversampling SMOTE; SHAP analysis; LIME analysis;
D O I
10.3390/info15060298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing fraudulent activity in the banking system is essential due to the significant risks involved. When fraudulent transactions are vastly outnumbered by non-fraudulent ones, dealing with imbalanced datasets can be difficult. This study aims to determine the best model for detecting fraud by comparing four commonly used machine learning algorithms: Support Vector Machine (SVM), XGBoost, Decision Tree, and Logistic Regression. Additionally, we utilized the Synthetic Minority Over-sampling Technique (SMOTE) to address the issue of class imbalance. The XGBoost Classifier proved to be the most successful model for fraud detection, with an accuracy of 99.88%. We utilized SHAP and LIME analyses to provide greater clarity into the decision-making process of the XGBoost model and improve overall comprehension. This research shows that the XGBoost Classifier is highly effective in detecting banking fraud on imbalanced datasets, with an impressive accuracy score. The interpretability of the XGBoost Classifier model was further enhanced by applying SHAP and LIME analysis, which shed light on the significant features that contribute to fraud detection. The insights and findings presented here are valuable contributions to the ongoing efforts aimed at developing effective fraud detection systems for the banking industry.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Explainable AI (XAI) Applied in Machine Learning for Pain Modeling: A Review
    Madanu, Ravichandra
    Abbod, Maysam F.
    Hsiao, Fu-Jung
    Chen, Wei-Ta
    Shieh, Jiann-Shing
    [J]. TECHNOLOGIES, 2022, 10 (03)
  • [2] From Explainable AI to Explainable Simulation: Using Machine Learning and XAI to understand System Robustness
    Feldkamp, Niclas
    Strassburger, Steffen
    [J]. PROCEEDINGS OF THE 2023 ACM SIGSIM INTERNATIONAL CONFERENCE ON PRINCIPLES OF ADVANCED DISCRETE SIMULATION, ACMSIGSIM-PADS 2023, 2023, : 96 - 106
  • [3] An Explainable Machine Learning Pipeline for Stroke Prediction on Imbalanced Data
    Kokkotis, Christos
    Giarmatzis, Georgios
    Giannakou, Erasmia
    Moustakidis, Serafeim
    Tsatalas, Themistoklis
    Tsiptsios, Dimitrios
    Vadikolias, Konstantinos
    Aggelousis, Nikolaos
    [J]. DIAGNOSTICS, 2022, 12 (10)
  • [4] Fraud Detection in Banking Data by Machine Learning Techniques
    Hashemi, Seyedeh Khadijeh
    Mirtaheri, Seyedeh Leili
    Greco, Sergio
    [J]. IEEE ACCESS, 2023, 11 : 3034 - 3043
  • [5] Machine Learning for Prediction of Imbalanced Data: Credit Fraud Detection
    Thanh Cong Tran
    Tran Khanh Dang
    [J]. PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [6] Explainable Machine Learning for Fraud Detection
    Psychoula, Ismini
    Gutmann, Andreas
    Mainali, Pradip
    Lee, S. H.
    Dunphy, Paul
    Petitcolas, Fabien A. P.
    [J]. COMPUTER, 2021, 54 (10) : 49 - 59
  • [7] Automated Machine Learning and Explainable AI (AutoML-XAI) for Metabolomics: Improving Cancer Diagnostics
    Bifarin, Olatomiwa O.
    Fernandez, Facundo M.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2024, 35 (06) : 1089 - 1100
  • [8] Explainable Machine Learning for Trustworthy AI
    Giannotti, Fosca
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 3 - 3
  • [9] Explainable AI (XAI) in Computational Pathology Pipelines: Translating Machine Learning Features into Pathologist-Friendly Language
    Fine, Jeffrey
    Tosun, Akif
    Taylor, D. Lansing
    Becich, Michael
    Chennubhotla, S. Chakra
    [J]. MODERN PATHOLOGY, 2019, 32
  • [10] Explainable AI (XAI) in Computational Pathology Pipelines: Translating Machine Learning Features into Pathologist-Friendly Language
    Fine, Jeffrey
    Tosun, Akif
    Taylor, D. Lansing
    Becich, Michael
    Chennubhotla, S. Chakra
    [J]. LABORATORY INVESTIGATION, 2019, 99