Federated learning model for credit card fraud detection with data balancing techniques

被引:0
|
作者
Mustafa Abdul Salam
Khaled M. Fouad
Doaa L. Elbably
Salah M. Elsayed
机构
[1] Benha University,Faculty of Computers and Artificial Intelligence
[2] Arab Open University,Faculty of Computer Studies
[3] New Mansoura University,Faculty of Computer Science and Engineering
[4] ElShorouk,Higher Institute for Computers & Information Technology
来源
关键词
Credit card fraud detection (CCFD); Federated learning; Data privacy; Class imbalance; Undersampling; Oversampling;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, credit card transaction fraud has resulted in massive losses for both consumers and banks. Subsequently, both cardholders and banks need a strong fraud detection system to reduce cardholder losses. Credit card fraud detection (CCFD) is an important method of fraud prevention. However, there are many challenges in developing an ideal fraud detection system for banks. First off, due to data security and privacy concerns, various banks and other financial institutions are typically not permitted to exchange their transaction datasets. These issues make traditional systems find it difficult to learn and detect fraud depictions. Therefore, this paper proposes federated learning for CCFD over different frameworks (TensorFlow federated, PyTorch). Second, there is a significant imbalance in credit card transactions across all banks, with a small percentage of fraudulent transactions outweighing the majority of valid ones. In order to demonstrate the urgent need for a comprehensive investigation of class imbalance management techniques to develop a powerful model to identify fraudulent transactions, the dataset must be balanced. In order to address the issue of class imbalance, this study also seeks to give a comparative analysis of several individual and hybrid resampling techniques. In several experimental studies, the effectiveness of various resampling techniques in combination with classification approaches has been compared. In this study, it is found that the hybrid resampling methods perform well for machine learning classification models compared to deep learning classification models. The experimental results show that the best accuracy for the Random Forest (RF); Logistic Regression; K-Nearest Neighbors (KNN); Decision Tree (DT), and Gaussian Naive Bayes (NB) classifiers are 99,99%; 94,61%; 99.96%; 99,98%, and 91,47%, respectively. The comparative results show that the RF outperforms with high performance parameters (accuracy, recall, precision and f score) better than NB; RF; DT and KNN. RF achieve the minimum loss values with all resampling techniques, and the results, when utilizing the proposed models on the entire skewed dataset, achieved preferable outcomes to the unbalanced dataset. Furthermore, the PyTorch framework achieves higher prediction accuracy for the federated learning model than the TensorFlow federated framework but with more computational time.
引用
收藏
页码:6231 / 6256
页数:25
相关论文
共 50 条
  • [31] Credit Card Fraud Detection using Deep Learning
    Shenvi, Pranali
    Samant, Neel
    Kumar, Shubham
    Kulkarni, Vaishali
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [32] Credit Card Fraud Detection with Machine Learning Methods
    Goy, Gokhan
    Gezer, Cengiz
    Gungor, Vehbi Cagri
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 350 - 354
  • [33] Transfer Learning Strategies for Credit Card Fraud Detection
    Lebichot, Bertrand
    Verhelst, Theo
    Le Borgne, Yann-Ael
    He-Guelton, Liyun
    Oble, Frederic
    Bontempi, Gianluca
    IEEE ACCESS, 2021, 9 : 114754 - 114766
  • [34] Credit Card Fraud Detection Based on Machine Learning
    Fang, Yong
    Zhang, Yunyun
    Huang, Cheng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 185 - 195
  • [35] Credit Card Fraud Detection Using Machine Learning
    Sailusha, Ruttala
    Gnaneswar, V
    Ramesh, R.
    Rao, G. Ramakoteswara
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 1264 - 1270
  • [36] Neural data mining for credit card fraud detection
    Guo, Tao
    Li, Gui-Yang
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3630 - 3634
  • [37] Web service based credit card fraud detection by applying machine learning techniques
    Prusti, Debachudamani
    Rath, Santanu Kumar
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 492 - 497
  • [38] Credit Card Fraud Detection
    Tiwari, Mohit
    Sharma, Vipul
    Bala, Devashish
    Devansh
    Kaushal, Dishant
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 1778 - 1789
  • [39] Distributed data mining in credit card fraud detection
    Chan, PK
    Fan, W
    Prodromidis, AL
    Stolfo, SJ
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (06): : 67 - 74
  • [40] Distributed data mining in credit card fraud detection
    Chan, Philip K.
    Fan, Wei
    Prodromidis, Andreas L.
    Stolfo, Salvatore J.
    IEEE Intelligent Systems and Their Applications, 14 (06): : 67 - 74