Enhancing Credit Card Fraud Detection Through a Novel Ensemble Feature Selection Technique

被引:4
|
作者
Wang, Huanjing [1 ]
Liang, Qianxin [2 ]
Hancock, John T., III [2 ]
Khoshgoftaar, Taghi M. [2 ]
机构
[1] Western Kentucky Univ, Bowling Green, KY 42101 USA
[2] Florida Atlantic Univ, Boca Raton, FL USA
关键词
Ensemble Supervised Feature Selection; Ensemble Threshold-Based Feature Selection; Credit Card Fraud; Highly Class Imbalance; ALGORITHMS; MACHINE;
D O I
10.1109/IRI58017.2023.00028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying fraudulent activities in credit card transactions is an inherent component of financial computing. The focus of our research is on the Credit Card Fraud Detection Dataset, which is widely used due to its authentic transaction data. In numerous machine learning applications, feature selection has become a crucial step. To improve the chance of discovering the globally optimal feature set, we employ ensembles of feature ranking methods. These ensemble methods merge multiple feature ranking lists through a median approach. We conduct a comprehensive empirical study that examines two different ensembles of feature ranking techniques, including an ensemble of twelve threshold-based feature selection (TBFS) techniques and an ensemble of five supervised feature selection (SFS) techniques. Additionally, we present results where all features are used. We construct classification models using two Decision Tree-based classifiers, CatBoost and XGBoost, and evaluate them using two different performance metrics, the Area Under the Receiver Operating Characteristic Curve (AUC) and the Area under the Precision-Recall Curve (AUPRC). Since AUPRC provides a more accurate representation of the number of false positives, especially for highly imbalanced datasets, evaluating models for AUPRC is a wise choice. The experimental results demonstrate that the ensemble of SFS and all features performs similarly or better than the ensemble of TBFS. Moreover, we find that XGBoost outperforms CatBoost in terms of AUPRC.
引用
收藏
页码:121 / 126
页数:6
相关论文
共 50 条
  • [21] Detection of Credit Card Fraud using a Hybrid Ensemble Model
    Saraf, Sayali
    Phakatkar, Anupama
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 464 - 474
  • [22] A machine learning based credit card fraud detection using the GA algorithm for feature selection
    Emmanuel Ileberi
    Yanxia Sun
    Zenghui Wang
    Journal of Big Data, 9
  • [23] A machine learning based credit card fraud detection using the GA algorithm for feature selection
    Ileberi, Emmanuel
    Sun, Yanxia
    Wang, Zenghui
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [24] Credit Card Fraud Detection
    Tiwari, Mohit
    Sharma, Vipul
    Bala, Devashish
    Devansh
    Kaushal, Dishant
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 1778 - 1789
  • [25] Implementation of Novel Approach for Credit Card Fraud Detection
    Agrawal, Ayushi
    Kumar, Shiv
    Mishra, Amit Kumar
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1 - 4
  • [26] A soft voting ensemble learning approach for credit card fraud detection
    Mim, Mimusa Azim
    Majadi, Nazia
    Mazumder, Peal
    HELIYON, 2024, 10 (03)
  • [27] Risk based Bagged Ensemble (RBE) for Credit Card Fraud Detection
    Akila, S.
    Reddy, U. Srinivasulu
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 670 - 674
  • [28] Application of Credit Card Fraud Detection: Based on Bagging Ensemble Classifier
    Zareapoor, Masoumeh
    Shamsolmoali, Pourya
    INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONVERGENCE (ICCC 2015), 2015, 48 : 679 - 685
  • [29] Credit card fraud detection using ensemble data mining methods
    Bakhtiari, Saeid
    Nasiri, Zahra
    Vahidi, Javad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29057 - 29075
  • [30] A Deep Learning Ensemble With Data Resampling for Credit Card Fraud Detection
    Mienye, Ibomoiye Domor
    Sun, Yanxia
    IEEE ACCESS, 2023, 11 : 30628 - 30638