Explainable machine learning models for Medicare fraud detection

被引:5
|
作者
Hancock, John T. [1 ]
Bauder, Richard A. [1 ]
Wang, Huanjing [2 ]
Khoshgoftaar, Taghi M. [1 ]
机构
[1] Florida Atlantic Univ, Coll Engn & Comp Sci, Boca Raton, FL 33004 USA
[2] Western Kentucky Univ, Ogden Coll Sci & Engn, Bowling Green, KY USA
关键词
Big Data; Class imbalance; Explainable machine learning models; Ensemble supervised feature selection; Medicare fraud detection;
D O I
10.1186/s40537-023-00821-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As a means of building explainable machine learning models for Big Data, we apply a novel ensemble supervised feature selection technique. The technique is applied to publicly available insurance claims data from the United States public health insurance program, Medicare. We approach Medicare insurance fraud detection as a supervised machine learning task of anomaly detection through the classification of highly imbalanced Big Data. Our objectives for feature selection are to increase efficiency in model training, and to develop more explainable machine learning models for fraud detection. Using two Big Data datasets derived from two different sources of insurance claims data, we demonstrate how our feature selection technique reduces the dimensionality of the datasets by approximately 87.5% without compromising performance. Moreover, the reduction in dimensionality results in machine learning models that are easier to explain, and less prone to overfitting. Therefore, our primary contribution of the exposition of our novel feature selection technique leads to a further contribution to the application domain of automated Medicare insurance fraud detection. We utilize our feature selection technique to provide an explanation of our fraud detection models in terms of the definitions of the selected features. The ensemble supervised feature selection technique we present is flexible in that any collection of machine learning algorithms that maintain a list of feature importance values may be used. Therefore, researchers may easily employ variations of the technique we present.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] Explainable Machine Learning Models for Swahili News Classification
    Murindanyi, Sudi
    Brian, Yiiki Afedra
    Katumba, Andrew
    Nakatumba-Nabende, Joyce
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 12 - 18
  • [42] Correction to: Fraud Detection Using Machine Learning and Deep Learning
    Akash Gandhar
    Kapil Gupta
    Aman Kumar Pandey
    Dharm Raj
    SN Computer Science, 5 (7)
  • [43] Explainable Machine Learning for Malware Detection on Android Applications
    Palma, Catarina
    Ferreira, Artur
    Figueiredo, Mario
    INFORMATION, 2024, 15 (01)
  • [44] Exploring Quantum Machine Learning for Explainable Malware Detection
    Ciaramella, Giovanni
    Martinelli, Fabio
    Mercaldo, Francesco
    Santone, Antonella
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [45] An Explainable Machine Learning Framework for Intrusion Detection Systems
    Wang, Maonan
    Zheng, Kangfeng
    Yang, Yanqing
    Wang, Xiujuan
    IEEE ACCESS, 2020, 8 : 73127 - 73141
  • [46] An Anomaly Detection Method for Medicare Fraud Detection
    Zhang, Weijia
    He, Xiaofeng
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (IEEE ICBK 2017), 2017, : 309 - 314
  • [47] Transparency and Privacy: The Role of Explainable AI and Federated Learning in Financial Fraud Detection
    Awosika, Tomisin
    Shukla, Raj Mani
    Pranggono, Bernardi
    IEEE ACCESS, 2024, 12 : 64551 - 64560
  • [48] Advanced Fraud Detection in Blockchain Transactions: An Ensemble Learning and Explainable AI Approach
    Taher, Shimal Sh.
    Ameen, Siddeeq Y.
    Ahmed, Jihan A.
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (01) : 12822 - 12830
  • [49] Fraud Detection in Banking Data by Machine Learning Techniques
    Hashemi, Seyedeh Khadijeh
    Mirtaheri, Seyedeh Leili
    Greco, Sergio
    IEEE ACCESS, 2023, 11 : 3034 - 3043
  • [50] Interpretable Machine Learning Models for Malicious Domains Detection Using Explainable Artificial Intelligence (XAI)
    Aslam, Nida
    Khan, Irfan Ullah
    Mirza, Samiha
    AlOwayed, Alanoud
    Anis, Fatima M.
    Aljuaid, Reef M.
    Baageel, Reham
    SUSTAINABILITY, 2022, 14 (12)