Unmasking Banking Fraud: Unleashing the Power of Machine Learning and Explainable AI (XAI) on Imbalanced Data

被引：0

作者：

Nobel, S. M. Nuruzzaman ^{[1
]}

Sultana, Shirin ^{[1
]}

Singha, Sondip Poul ^{[1
]}

Chaki, Sudipto ^{[1
]}

Mahi, Md. Julkar Nayeen ^{[2
]}

Jan, Tony ^{[3
]}

Barros, Alistair ^{[4
]}

Whaiduzzaman, Md ^{[3
,4
]}

机构：

[1] Bangladesh Univ Business & Technol, Dept Comp Sci & Engn, Dhaka 1216, Bangladesh

[2] Daffodil Int Univ, Dept Software Engn, Dhaka 1207, Bangladesh

[3] Torrens Univ, Design & Creat Technol, Brisbane, Qld 4006, Australia

[4] Queensland Univ Technol, Sch Informat Syst, Brisbane, Qld 4000, Australia

来源：

INFORMATION | 2024年 / 15卷 / 06期

关键词：

fraud detection; machine learning; logistic regression; decision tree; SVM; XGBoost; oversampling SMOTE; SHAP analysis; LIME analysis;

D O I：

10.3390/info15060298

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recognizing fraudulent activity in the banking system is essential due to the significant risks involved. When fraudulent transactions are vastly outnumbered by non-fraudulent ones, dealing with imbalanced datasets can be difficult. This study aims to determine the best model for detecting fraud by comparing four commonly used machine learning algorithms: Support Vector Machine (SVM), XGBoost, Decision Tree, and Logistic Regression. Additionally, we utilized the Synthetic Minority Over-sampling Technique (SMOTE) to address the issue of class imbalance. The XGBoost Classifier proved to be the most successful model for fraud detection, with an accuracy of 99.88%. We utilized SHAP and LIME analyses to provide greater clarity into the decision-making process of the XGBoost model and improve overall comprehension. This research shows that the XGBoost Classifier is highly effective in detecting banking fraud on imbalanced datasets, with an impressive accuracy score. The interpretability of the XGBoost Classifier model was further enhanced by applying SHAP and LIME analysis, which shed light on the significant features that contribute to fraud detection. The insights and findings presented here are valuable contributions to the ongoing efforts aimed at developing effective fraud detection systems for the banking industry.

引用

页数：22

共 50 条

[1] Explainable AI (XAI) Applied in Machine Learning for Pain Modeling: A Review
Madanu, Ravichandra
Abbod, Maysam F.
Hsiao, Fu-Jung
Chen, Wei-Ta
Shieh, Jiann-Shing
[J]. TECHNOLOGIES, 2022, 10 (03)
[2] From Explainable AI to Explainable Simulation: Using Machine Learning and XAI to understand System Robustness
Feldkamp, Niclas
Strassburger, Steffen
[J]. PROCEEDINGS OF THE 2023 ACM SIGSIM INTERNATIONAL CONFERENCE ON PRINCIPLES OF ADVANCED DISCRETE SIMULATION, ACMSIGSIM-PADS 2023, 2023, : 96 - 106
[3] An Explainable Machine Learning Pipeline for Stroke Prediction on Imbalanced Data
Kokkotis, Christos
Giarmatzis, Georgios
Giannakou, Erasmia
Moustakidis, Serafeim
Tsatalas, Themistoklis
Tsiptsios, Dimitrios
Vadikolias, Konstantinos
Aggelousis, Nikolaos
[J]. DIAGNOSTICS, 2022, 12 (10)
[4] Fraud Detection in Banking Data by Machine Learning Techniques
Hashemi, Seyedeh Khadijeh
Mirtaheri, Seyedeh Leili
Greco, Sergio
[J]. IEEE ACCESS, 2023, 11 : 3034 - 3043
[5] Machine Learning for Prediction of Imbalanced Data: Credit Fraud Detection
Thanh Cong Tran
Tran Khanh Dang
[J]. PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
[6] Explainable Machine Learning for Fraud Detection
Psychoula, Ismini
Gutmann, Andreas
Mainali, Pradip
Lee, S. H.
Dunphy, Paul
Petitcolas, Fabien A. P.
[J]. COMPUTER, 2021, 54 (10) : 49 - 59
[7] Automated Machine Learning and Explainable AI (AutoML-XAI) for Metabolomics: Improving Cancer Diagnostics
Bifarin, Olatomiwa O.
Fernandez, Facundo M.
[J]. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2024, 35 (06) : 1089 - 1100
[8] Explainable Machine Learning for Trustworthy AI
Giannotti, Fosca
[J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 3 - 3
[9] Explainable AI (XAI) in Computational Pathology Pipelines: Translating Machine Learning Features into Pathologist-Friendly Language
Fine, Jeffrey
Tosun, Akif
Taylor, D. Lansing
Becich, Michael
Chennubhotla, S. Chakra
[J]. MODERN PATHOLOGY, 2019, 32
[10] Explainable AI (XAI) in Computational Pathology Pipelines: Translating Machine Learning Features into Pathologist-Friendly Language
Fine, Jeffrey
Tosun, Akif
Taylor, D. Lansing
Becich, Michael
Chennubhotla, S. Chakra
[J]. LABORATORY INVESTIGATION, 2019, 99

← 1 2 3 4 5 →