Methods of Handling Unbalanced Datasets in Credit Card Fraud Detection

被引:5
|
作者
Minastireanu, Elena-Adriana [1 ]
Mesnita, Gabriela [2 ]
机构
[1] Alexandru Ioan Cuza Univ, Doctoral Sch Econ & Business Adm, Iasi 700057, Romania
[2] Alexandru Ioan Cuza Univ, Fac Econ & Business Adm, Business Informat Syst Dept, Iasi 700057, Romania
关键词
bank fraud; machine-learning algorithms; resampling; cost-sensitive training; unbalanced dataset; CLASSIFICATION; SMOTE;
D O I
10.18662/brain/11.1/19
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Nowadays fraudulent transactions of every type represent a major concern in the, financial industry due to the total amount of money that are lost every year. Manually analyzing fraudulent transactions is unfeasible if re think at the huge amount of data and the complexity of bank fraud in the digitization era. In this context, the problem to detect the fraud can be achieved by machine-learning algorithms due to their ability of detecting small anomalies in very large datasets. The problem that arise here is that the datasets are highly unbalanced meaning that the non-fraudulent cases heavily dominates the fraudulent ones. In this paper, we are going to present three :rays of handling unbalanced datasets by: resampling methods (undersampling and oversampling), cost :sensitive training and tree algorithms (decision tree, random forest and Naive Bays), emphasizing the idea of why the Receiver Operating Characteristics curve (ROC) should not he used on this type of datasets when measuring the performance of the algorithm. The experimental test was applied on a number of 890,977 banking transactions in order to observe the performance metrics of all the three methods mentioned above.
引用
收藏
页码:131 / 143
页数:13
相关论文
共 50 条
  • [31] Application of classification models on credit card fraud detection
    Shen, Aihua
    Tong, Rencheng
    Deng, Yaochen
    2007 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1-3, 2007, : 465 - +
  • [32] Neural data mining for credit card fraud detection
    Guo, Tao
    Li, Gui-Yang
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3630 - 3634
  • [33] Representation Learning in Graphs for Credit Card Fraud Detection
    Van Belle, Rafael
    Mitrovic, Sandra
    De Weerdt, Jochen
    MINING DATA FOR FINANCIAL APPLICATIONS, 2020, 11985 : 32 - 46
  • [34] Credit Card Fraud Detection Using Capsule Network
    Wang, Shuo
    Liu, Guanjun
    Li, Zhenchuan
    Xuan, Shiyang
    Yan, Chungang
    Jiang, Changjun
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3679 - 3684
  • [35] The Importance of Future Information in Credit Card Fraud Detection
    Nguyen, Van Bach
    Dastidar, Kanishka Ghosh
    Granitzer, Michael
    Siblini, Wissam
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [36] Credit Card Fraud Detection Using Anomaly Techniques
    Sharmila, V. Ceronmani
    Kumar, Kiran R.
    Sundaram, R.
    Samyuktha, D.
    Harish, R.
    PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
  • [37] Explainable Credit Card Fraud Detection with Image Conversion
    Terzi, Duygu Sinanc
    Demirezen, Umut
    Sagiroglu, Seref
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2021, 10 (01): : 63 - 76
  • [38] Comparison with Parametric Optimization in Credit Card Fraud Detection
    Gadi, Manoel Fernando Alonso
    Wang, Xidi
    do Lago, Alair Pereira
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 279 - +
  • [39] AutoEncoder and LightGBM for Credit Card Fraud Detection Problems
    Du, Haichao
    Lv, Li
    Guo, An
    Wang, Hongliang
    SYMMETRY-BASEL, 2023, 15 (04):
  • [40] Credit Card Fraud Detection using Deep Learning
    Shenvi, Pranali
    Samant, Neel
    Kumar, Shubham
    Kulkarni, Vaishali
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,