UNSUPERVISED LABELING OF DATA FOR SUPERVISED LEARNING AND ITS APPLICATION TO MEDICAL CLAIMS PREDICTION

被引:5
|
作者
Ngufor, Che [1 ]
Wojtusiak, Janusz [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
来源
COMPUTER SCIENCE-AGH | 2013年 / 14卷 / 02期
关键词
unsupervised learning; concept drift; medical claims;
D O I
10.7494/csci.2013.14.2.191
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The task identifying changes and irregularities in medical insurance claim payments is a difficult process of which the traditional practice involves querying historical claims databases and flagging potential claims as normal or abnormal. Because what is considered as normal payment is usually unknown and may change over time, abnormal payments often pass undetected; only to be discovered when the payment period has passed. This paper presents the problem of on-line unsupervised learning from data streams when the distribution that generates the data changes or drifts over time. Automated algorithms for detecting drifting concepts in a probability distribution of the data are presented. The idea behind the presented drift detection methods is to transform the distribution of the data within a sliding window into a more convenient distribution. Then, a test statistics p-value at a given significance level can be used to infer the drift rate, adjust the window size and decide on the status of the drift. The detected concepts drifts are used to label the data, for subsequent learning of classification models by a supervised learner. The algorithms were tested on several synthetic and real medical claims data sets.
引用
收藏
页码:191 / 214
页数:24
相关论文
共 50 条
  • [41] Unsupervised clustering methods for medical data: An application to thyroid gland data
    Albayrak, S
    ARTIFICAIL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 695 - 701
  • [42] Link Weight Prediction Using Supervised Learning Methods and Its Application to Yelp Layered Network
    Fu, Chenbo
    Zhao, Minghao
    Fan, Lu
    Chen, Xinyi
    Chen, Jinyin
    Wu, Zhefu
    Xia, Yongxiang
    Xuan, Qi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (08) : 1507 - 1518
  • [43] Combining unsupervised and supervised learning techniques for enhancing the performance of functional data classifiers
    Maturo, Fabrizio
    Verde, Rosanna
    COMPUTATIONAL STATISTICS, 2024, 39 (01) : 239 - 270
  • [44] Categorizing Driving Patterns based on Telematics Data Using Supervised and Unsupervised Learning
    Narwani, Bhumika
    Muchhala, Yash
    Nawani, Jatin
    Pawar, Renuka
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 302 - 306
  • [45] Data mining application in prosecution committee for unsupervised learning
    Liu, P
    Zhu, JX
    Liu, LJ
    Li, YH
    Zhang, XF
    2005 INTERNATIONAL CONFERENCE ON SERVICES SYSTEMS AND SERVICES MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1061 - 1064
  • [46] Combining unsupervised and supervised learning techniques for enhancing the performance of functional data classifiers
    Fabrizio Maturo
    Rosanna Verde
    Computational Statistics, 2024, 39 : 239 - 270
  • [47] A Bayes-true data generator for evaluation of supervised and unsupervised learning methods
    Frasch, Janick V.
    Lodwich, Aleksander
    Shafait, Faisal
    Breuel, Thomas M.
    PATTERN RECOGNITION LETTERS, 2011, 32 (11) : 1523 - 1531
  • [48] Prediction of SGEMM GPU Kernel Performance using Supervised and Unsupervised Machine Learning Techniques
    Agrawal, Sanket
    Bansal, Akshay
    Rathor, Sandeep
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [49] Learning from Privacy Preserved Encrypted Data on Cloud Through Supervised and Unsupervised Machine Learning
    Khan, Ahmad Neyaz
    Fan, Ming Yu
    Malik, Asad
    Memon, Raheel Ahmed
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING, MATHEMATICS AND ENGINEERING TECHNOLOGIES (ICOMET), 2019,
  • [50] An unsupervised learning approach to resolving the data imbalanced issue in supervised learning problems in functional genomics
    Yoon, K
    Kwek, S
    HIS 2005: 5TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, : 303 - 308