Methods of Handling Unbalanced Datasets in Credit Card Fraud Detection

被引:5
|
作者
Minastireanu, Elena-Adriana [1 ]
Mesnita, Gabriela [2 ]
机构
[1] Alexandru Ioan Cuza Univ, Doctoral Sch Econ & Business Adm, Iasi 700057, Romania
[2] Alexandru Ioan Cuza Univ, Fac Econ & Business Adm, Business Informat Syst Dept, Iasi 700057, Romania
关键词
bank fraud; machine-learning algorithms; resampling; cost-sensitive training; unbalanced dataset; CLASSIFICATION; SMOTE;
D O I
10.18662/brain/11.1/19
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Nowadays fraudulent transactions of every type represent a major concern in the, financial industry due to the total amount of money that are lost every year. Manually analyzing fraudulent transactions is unfeasible if re think at the huge amount of data and the complexity of bank fraud in the digitization era. In this context, the problem to detect the fraud can be achieved by machine-learning algorithms due to their ability of detecting small anomalies in very large datasets. The problem that arise here is that the datasets are highly unbalanced meaning that the non-fraudulent cases heavily dominates the fraudulent ones. In this paper, we are going to present three :rays of handling unbalanced datasets by: resampling methods (undersampling and oversampling), cost :sensitive training and tree algorithms (decision tree, random forest and Naive Bays), emphasizing the idea of why the Receiver Operating Characteristics curve (ROC) should not he used on this type of datasets when measuring the performance of the algorithm. The experimental test was applied on a number of 890,977 banking transactions in order to observe the performance metrics of all the three methods mentioned above.
引用
收藏
页码:131 / 143
页数:13
相关论文
共 50 条
  • [21] Neural fraud detection in credit card operations
    Dorronsoro, JR
    Ginel, F
    Sanchez, C
    Cruz, CS
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (04): : 827 - 834
  • [22] A Novel Framework for Credit Card Fraud Detection
    Mniai, Ayoub
    Tarik, Mouna
    Jebari, Khalid
    IEEE ACCESS, 2023, 11 : 112776 - 112786
  • [23] Ensemble Method for Credit Card Fraud Detection
    Wang, Rui
    Liu, Guanjun
    2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021), 2021, : 246 - 252
  • [24] Credit Card Fraud Detection Using CNN
    Murugan, Yogamahalakshmi
    Vijayalakshmi, M.
    Selvaraj, Lavanya
    Balaraman, Saranya
    INTERNET OF THINGS AND CONNECTED TECHNOLOGIES, 2022, 340 : 194 - 204
  • [25] Bayesian Quickest Detection of Credit Card Fraud
    Buonaguidi, Bruno
    Mira, Antonietta
    Bucheli, Herbert
    Vitanis, Viton
    BAYESIAN ANALYSIS, 2022, 17 (01): : 261 - 290
  • [26] Credit Card Fraud Detection: A Case Study
    Agrawal, Ayushi
    Kumar, Shiv
    Mishra, Amit Kumar
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 5 - 7
  • [27] A Novel Approach for Credit Card Fraud Detection
    Agrawal, Ayushi
    Kumar, Shiv
    Mishra, Amit Kumar
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 8 - 11
  • [28] Anomaly Detection using Unsupervised Methods: Credit Card Fraud Case Study
    Rezapour, Mahdi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (11) : 1 - 8
  • [29] Credit card fraud
    Walsh, FJ
    PHOTONICS SPECTRA, 1997, 31 (03) : 12 - 12
  • [30] Fraud Shield: Credit Card Fraud Detection with Ensemble and Deep Learning
    Menon, Pranav Prakash
    Sachdeva, Amit
    Gayathn, V. M.
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 224 - 230