UNSUPERVISED LABELING OF DATA FOR SUPERVISED LEARNING AND ITS APPLICATION TO MEDICAL CLAIMS PREDICTION

被引:5
|
作者
Ngufor, Che [1 ]
Wojtusiak, Janusz [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
来源
COMPUTER SCIENCE-AGH | 2013年 / 14卷 / 02期
关键词
unsupervised learning; concept drift; medical claims;
D O I
10.7494/csci.2013.14.2.191
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The task identifying changes and irregularities in medical insurance claim payments is a difficult process of which the traditional practice involves querying historical claims databases and flagging potential claims as normal or abnormal. Because what is considered as normal payment is usually unknown and may change over time, abnormal payments often pass undetected; only to be discovered when the payment period has passed. This paper presents the problem of on-line unsupervised learning from data streams when the distribution that generates the data changes or drifts over time. Automated algorithms for detecting drifting concepts in a probability distribution of the data are presented. The idea behind the presented drift detection methods is to transform the distribution of the data within a sliding window into a more convenient distribution. Then, a test statistics p-value at a given significance level can be used to infer the drift rate, adjust the window size and decide on the status of the drift. The detected concepts drifts are used to label the data, for subsequent learning of classification models by a supervised learner. The algorithms were tested on several synthetic and real medical claims data sets.
引用
收藏
页码:191 / 214
页数:24
相关论文
共 50 条
  • [1] Supervised an unsupervised learning approaches for the labeling of multivariate images
    Bertrand, D
    Novales, B
    Chtioui, Y
    PRECISION AGRICULTURE AND BIOLOGICAL QUALITY, 1999, 3543 : 44 - 52
  • [2] Improving Landslides Prediction: Meteorological Data Preprocessing Based on Supervised and Unsupervised Learning
    Guerrero-Rodriguez, Byron
    Salvador-Meneses, Jaime
    Garcia-Rodriguez, Jose
    Mejia-Escobar, Christian
    CYBERNETICS AND SYSTEMS, 2024, 55 (06) : 1332 - 1356
  • [3] Semi-Supervised Prediction of Comorbid Rare Conditions Using Medical Claims Data
    Nagpal, Chirag
    Miller, Kyle
    Pellathy, Tiffany
    Hravnak, Marilyn
    Clermont, Gilles
    Pinsky, Michael
    Dubrawski, Artur
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 478 - 485
  • [4] Combining supervised and unsupervised learning for data clustering
    Paolo Corsini
    Beatrice Lazzerini
    Francesco Marcelloni
    Neural Computing & Applications, 2006, 15 : 289 - 297
  • [5] Combining supervised and unsupervised learning for data clustering
    Corsini, Paolo
    Lazzerini, Beatrice
    Marcelloni, Francesco
    NEURAL COMPUTING & APPLICATIONS, 2006, 15 (3-4): : 289 - 297
  • [6] Unsupervised Selective Labeling for More Effective Semi-supervised Learning
    Wang, Xudong
    Lian, Long
    Yu, Stella X.
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 427 - 445
  • [7] Supervised and Unsupervised Deep Learning Approaches for EEG Seizure Prediction
    Georgis-Yap, Zakary
    Popovic, Milos R.
    Khan, Shehroz S.
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2024, 8 (02) : 286 - 312
  • [8] Combining Supervised and Unsupervised Learning for Improved miRNA Target Prediction
    Sedaghat, Nafiseh
    Fathy, Mahmood
    Modarressi, Mohammad Hossein
    Shojaie, Ali
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (05) : 1594 - 1604
  • [9] Supervised and Unsupervised Deep Learning Approaches for EEG Seizure Prediction
    Zakary Georgis-Yap
    Milos R. Popovic
    Shehroz S. Khan
    Journal of Healthcare Informatics Research, 2024, 8 : 286 - 312
  • [10] Improving facies prediction by combining supervised and unsupervised learning methods
    Ippolito, Marco
    Ferguson, John
    Jenson, Fred
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2021, 200