Simulation and Detection of Healthcare Fraud in German Inpatient Claims Data

被引:0
|
作者
Schrupp, Bernhard [1 ,2 ]
Klede, Kai [1 ]
Raab, Rene [1 ]
Eskofier, Bjoern [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Machine Learning & Data Analyt Lab, Dept Artificial Intelligence Biomed Engn AIBE, Erlangen, Germany
[2] AOK Bayern Die Gesundheitskasse, Munich, Germany
来源
关键词
Healthcare; Inpatient Claims; Healthcare Fraud; Fraud Detection; Data Generation; Inpatient Claims Simulation;
D O I
10.1007/978-3-031-63772-8_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The German Federal Criminal Police Office (BKA) reported damages of 72.6 million euros due to billing fraud in the German healthcare system in 2022, an increase of 25% from the previous year. However, existing literature on automated healthcare fraud detection focuses on US, Taiwanese, or private data, and detection approaches based on individual claims are virtually nonexistent. In this work, we develop machine learning methods that detect fraud in German hospital billing data. The lack of publicly available and labeled datasets limits the development of such methods. Therefore, we simulated inpatient treatments based on publicly available statistics on main and secondary diagnoses, operations and demographic information. We injected different types of fraud that were identified from the literature. This is the first complete simulator for inpatient care data, enabling further research in inpatient care. We trained and compared several Machine Learning models on the simulated dataset. Gradient Boosting and Random Forest achieved the best results with a weighted F1 measure of approximately 80%. An in-depth analysis of the presented methods shows they excel at detecting compensation-related fraud, such as DRG upcoding. An impact analysis on private inpatient claims data of a big German health insurance company revealed that up to 12% of all treatments were identified as potentially fraudulent.
引用
收藏
页码:239 / 246
页数:8
相关论文
共 50 条