Modeling Insurance Fraud Detection Using Imbalanced Data Classification

被引:31
|
作者
Hassan, Amira Kamil Ibrahim [1 ,2 ]
Abraham, Ajith [1 ,3 ]
机构
[1] Sudan Univ Sci & Technol, Dept Comp Sci, Khartoum, Sudan
[2] MIR Labs, Auburn, WA USA
[3] VSB Tech Univ Ostrava, IT4Innovat, Ostrava, Czech Republic
关键词
Insurance fraud detection; Imbalanced data; Decision tree; Support vector machine and artificial neural network; AUTOMOBILE INSURANCE; CLAIMS;
D O I
10.1007/978-3-319-27400-3_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an innovative insurance fraud detection method to deal with the imbalanced data distribution. The idea is based on building insurance fraud detection models using Decision tree (DT), Support vector machine (SVM) and Artificial Neural Network (ANN), on data partitions derived from under-sampling (with-replacement and without-replacement) of the majority class and merging it with the minority class. Throughout the paper, ten-fold cross validation method of testing is used. Its originality lies in the use of several partitioning under-sampling approaches and choosing the best. Results from a publicly available automobile insurance fraud detection data set demonstrate that DT performs slightly better than other algorithms, so DT model was used to compare between different partitioning-under-sampling approaches. Empirical results illustrate that the proposed model gave better results.
引用
收藏
页码:117 / 127
页数:11
相关论文
共 50 条
  • [1] Classification of Imbalanced Auction Fraud Data
    Ganguly, Swati
    Sadaoui, Samira
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2017, 2017, 10233 : 84 - 89
  • [2] An Effective Data Sampling Procedure for Imbalanced Data Learning on Health Insurance Fraud Detection
    Kotekani S.S.
    Velchamy I.
    Journal of Computing and Information Technology, 2020, 28 (04) : 269 - 285
  • [3] Applying MASI Algorithm to Improve the Classification Performance of Imbalanced Data in Fraud Detection
    Thi-Lich Nghiem
    Thi-Toan Nghiem
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING (ICCSAMA 2019), 2020, 1121 : 150 - 162
  • [4] Healthcare insurance fraud detection using data mining
    Hamid, Zain
    Khalique, Fatima
    Mahmood, Saba
    Daud, Ali
    Bukhari, Amal
    Alshemaimri, Bader
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [5] Fraud Detection in Health Insurance using Data Mining Techniques
    Rawte, Vipula
    Anuradha, G.
    2015 International Conference on Communication, Information & Computing Technology (ICCICT), 2015,
  • [6] FRAUD DETECTION USING OUTLIER PREDICTOR IN HEALTH INSURANCE DATA
    Anbarasi, M. S.
    Dhivya, S.
    2017 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2017,
  • [7] Using Genetic Algorithm to Improve Classification of Imbalanced Datasets for credit card fraud detection
    Benchaji, Ibtissam
    Douzi, Samira
    El Ouahidi, Bouabid
    2018 2ND CYBER SECURITY IN NETWORKING CONFERENCE (CSNET), 2018,
  • [8] Medicare Fraud Detection using Random Forest with Class Imbalanced Big Data
    Bauder, Richard A.
    Khoshgoftaar, Taghi M.
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 80 - 87
  • [9] Cryptocurrency Transaction Fraud Detection Based on Imbalanced Classification With Interpretable Analysis
    Yin, Pei
    Jiang, Wen-Long
    Ma, Zi-Jie
    Zhang, Li-Ke
    International Journal of Intelligent Information Technologies, 2024, 20 (01)
  • [10] robROSE: A robust approach for dealing with imbalanced data in fraud detection
    Baesens, Bart
    Hoeppner, Sebastiaan
    Ortner, Irene
    Verdonck, Tim
    STATISTICAL METHODS AND APPLICATIONS, 2021, 30 (03): : 841 - 861