An ensemble learning approach for anomaly detection in credit card data with imbalanced and overlapped classes

被引:8
|
作者
Islam, Md Amirul [1 ]
Uddin, Md Ashraf [2 ]
Aryal, Sunil [2 ]
Stea, Giovanni [1 ]
机构
[1] Univ Pisa, Dept Informat Engn, Pisa, Italy
[2] Deakin Univ, Sch Informat Technol, Geelong, Australia
关键词
Anomaly detection; Credit card; Ensemble; Meta-learning; Base learner; Classification; DECISION TREE APPROACH; FRAUD DETECTION; MACHINE; CLASSIFICATION; SUPPORT;
D O I
10.1016/j.jisa.2023.103618
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Electronic payment methods have become increasingly popular for business transactions, both online and in person, across the globe. Anomalies like online fraud and default payments, which can result in substantial financial losses, have become more common as the usage of credit cards in online purchases has increased. To address this issue, researchers have explored various machine learning models and their ensemble techniques for detecting anomalies in credit card transaction data. However, detecting anomalies in this data can be challenging due to overlapping class samples and an imbalanced class distribution. Therefore, the detection rate of anomalies from minority class samples is relatively low, and general learning algorithms can be biased towards the majority class samples. In this paper, we propose a model called Credit Card Anomaly Detection (CCAD) that leverages the base learners paradigm and meta-learning ensemble techniques to improve the detection rate of credit card anomalies. We utilize four outlier detection algorithms as base learners and XGBoost algorithm as meta learner in the proposed stacked ensemble approach to detect anomaly in credit card transactions. We apply stratified sampling technique and k-fold cross-validation process to address the issues of data imbalance and overfitting. In addition, the discordance rate is calculated to enhance the accuracy of ensemble learning performances. The proposed model is trained and tested using two datasets: CCF (Credit Card Fraud) and CCDP (Credit Card Default Payment). Experimental results demonstrate that our approach outperforms existing approaches, particularly in detecting anomalies from the minority class instances of these datasets.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] A Comprehensive Machine Learning Framework for Anomaly Detection in Credit Card Transactions
    Jeribi, Fathe
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 871 - 880
  • [22] Synthesizing class labels for highly imbalanced credit card fraud detection data
    Robert K. L. Kennedy
    Flavio Villanustre
    Taghi M. Khoshgoftaar
    Zahra Salekshahrezaee
    Journal of Big Data, 11
  • [23] An efficient fraud detection framework with credit card imbalanced data in financial services
    Aya Abd El-Naby
    Ezz El-Din Hemdan
    Ayman El-Sayed
    Multimedia Tools and Applications, 2023, 82 : 4139 - 4160
  • [24] Fraud Shield: Credit Card Fraud Detection with Ensemble and Deep Learning
    Menon, Pranav Prakash
    Sachdeva, Amit
    Gayathn, V. M.
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 224 - 230
  • [25] Credit Card Fraud Detection using Non-Overlapped Risk based Bagging Ensemble (NRBE)
    Akila, S.
    Reddy, U. Srinivasulu
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 323 - 326
  • [26] A Hybrid Deep Learning Ensemble Model for Credit Card Fraud Detection
    Ileberi, Emmanuel
    Sun, Yanxia
    IEEE ACCESS, 2024, 12 : 175829 - 175838
  • [27] Credit Card Fraud Prediction Using XGBoost: An Ensemble Learning Approach
    Mohbey, Krishna Kumar
    Khan, Mohammad Zubair
    Indian, Ajay
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (02)
  • [28] Credit card fraud detection using ensemble data mining methods
    Bakhtiari, Saeid
    Nasiri, Zahra
    Vahidi, Javad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29057 - 29075
  • [29] Credit card fraud detection using ensemble data mining methods
    Saeid Bakhtiari
    Zahra Nasiri
    Javad Vahidi
    Multimedia Tools and Applications, 2023, 82 : 29057 - 29075
  • [30] A voting ensemble machine learning based credit card fraud detection using highly imbalance data
    Raunak Chhabra
    Shailza Goswami
    Ranjeet Kumar Ranjan
    Multimedia Tools and Applications, 2024, 83 : 54729 - 54753