Data mining a diabetic data warehouse

被引:79
|
作者
Breault, JL
Goodall, CR
Fos, PJ
机构
[1] Alton Ochsner Med Fdn & Ochsner Clin, New Orleans, LA 70121 USA
[2] Tulane Univ, New Orleans, LA 70112 USA
[3] AT&T Corp, Shannon Res & Technol Lab, Middletown, NJ 07748 USA
[4] Tulane Univ, New Orleans, LA 70112 USA
[5] Univ Nevada, Sch Dent, Las Vegas, NV 89154 USA
关键词
data mining; diabetes; data mining software CART;
D O I
10.1016/S0933-3657(02)00051-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diabetes is a major health problem in the United States. There is a long history of diabetic registries and databases with systematically collected patient information. We examine one such diabetic data warehouse, showing a method of applying data mining techniques, and some of the data issues, analysis problems, and results. The diabetic data warehouse is from a large integrated health care system in the New Orleans area with 30,383 diabetic patients. Methods for translating a complex relational database with time series and sequencing information to a flat file suitable for data mining are challenging. We discuss two variables in detail, a comorbidity index and the HgbA1c, a measure of glycemic control related to outcomes. We used the classification tree approach in Classification and Regression Trees (CART((R))) with a binary target variable of HgbA1c >9.5 and 10 predictors: age, sex, emergency department visits, office visits, comorbidity index, dyslipidemia, hypertension, cardiovascular disease, retinopathy, end-stage renal disease. Unexpectedly, the most important variable associated with bad glycemic control is younger age, not the comorbiditity index or whether patients have related diseases. If we want to target diabetics with bad HgbA1c values, the odds of finding them is 3.2 times as high in those <6.5 years of age than those older. Data mining can discover novel associations that are useful to clinicians and administrators. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:37 / 54
页数:18
相关论文
共 50 条
  • [1] The data warehouse and data mining
    Inmon, WH
    [J]. COMMUNICATIONS OF THE ACM, 1996, 39 (11) : 49 - 50
  • [2] Data mining a diabetic data warehouse (vol 26, pg 37, 2002)
    Breault, JL
    Goodall, CR
    Fos, PJ
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2003, 27 (02) : 227 - 227
  • [3] Data warehouse and data mining in a surgical clinic
    Tusch, G
    Müller, M
    Rohwer-Mensching, K
    Heiringhoff, K
    Klempnauer, J
    [J]. MEDICAL INFOBAHN FOR EUROPE, PROCEEDINGS, 2000, 77 : 784 - 789
  • [4] The Establishment and Data Mining of Meteorological Data Warehouse
    Shao, Lei
    Liu, Jun
    Dong, Guoling
    Mu, Yi
    Guo, Peng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2014), 2014, : 2049 - 2054
  • [5] The Applications of Data Mining in Tax Data Warehouse
    Tao, Wang
    Ning, Guo
    [J]. PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGY AND ENGINEERING, 2009, : 103 - +
  • [6] The study and application of data mining and data warehouse in CIMS
    Zhou, LJ
    Liu, C
    Liu, DX
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: TOOLS AND TECHNOLOGY V, 2003, 5098 : 272 - 278
  • [7] Study of MIS based on data warehouse & data mining
    Zhao, JH
    [J]. PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, 2000, : 328 - 330
  • [8] An evolutive algorithm for the data mining in population data warehouse
    Yang, Wenchuan
    Wang, Peng
    Gao, Chunyang
    Fan, Yanyang
    Luan, Huahua
    [J]. PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 881 - 885
  • [9] Application of Data Warehouse and Data Mining in Landscape Research
    Duan Ran
    Yang Chunyu
    Su Jiafu
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEM (ICISS 2018), 2018, : 112 - 113
  • [10] Data Warehouse and Data Mining Based on Marine Environment
    Dong, Xiaowei
    [J]. JOURNAL OF COASTAL RESEARCH, 2020, : 394 - 397