Data mining a diabetic data warehouse

被引:79
|
作者
Breault, JL
Goodall, CR
Fos, PJ
机构
[1] Alton Ochsner Med Fdn & Ochsner Clin, New Orleans, LA 70121 USA
[2] Tulane Univ, New Orleans, LA 70112 USA
[3] AT&T Corp, Shannon Res & Technol Lab, Middletown, NJ 07748 USA
[4] Tulane Univ, New Orleans, LA 70112 USA
[5] Univ Nevada, Sch Dent, Las Vegas, NV 89154 USA
关键词
data mining; diabetes; data mining software CART;
D O I
10.1016/S0933-3657(02)00051-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diabetes is a major health problem in the United States. There is a long history of diabetic registries and databases with systematically collected patient information. We examine one such diabetic data warehouse, showing a method of applying data mining techniques, and some of the data issues, analysis problems, and results. The diabetic data warehouse is from a large integrated health care system in the New Orleans area with 30,383 diabetic patients. Methods for translating a complex relational database with time series and sequencing information to a flat file suitable for data mining are challenging. We discuss two variables in detail, a comorbidity index and the HgbA1c, a measure of glycemic control related to outcomes. We used the classification tree approach in Classification and Regression Trees (CART((R))) with a binary target variable of HgbA1c >9.5 and 10 predictors: age, sex, emergency department visits, office visits, comorbidity index, dyslipidemia, hypertension, cardiovascular disease, retinopathy, end-stage renal disease. Unexpectedly, the most important variable associated with bad glycemic control is younger age, not the comorbiditity index or whether patients have related diseases. If we want to target diabetics with bad HgbA1c values, the odds of finding them is 3.2 times as high in those <6.5 years of age than those older. Data mining can discover novel associations that are useful to clinicians and administrators. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:37 / 54
页数:18
相关论文
共 50 条
  • [41] Spatial Data Mining of a Population-Based Data Warehouse of Cancer in Mexico
    Perez-Ortega, Joaquin
    Miranda-Henriques, Fatima
    Reyes-Salgado, Gerardo
    Santaolaya-Salgado, Rene
    Pazos-Rangel, Rodolfo A.
    Mexicano-Santoyo, Adriana
    [J]. INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2010, 1 (01): : 61 - 67
  • [42] Mining association rule efficiently based on data warehouse
    Chen, XH
    Lai, BC
    Luo, D
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY OF TECHNOLOGY, 2003, 10 (04): : 375 - 380
  • [43] A proposal of integrating data mining and on-line analytical processing in data warehouse
    Liu, Z
    Guo, MY
    [J]. 2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C146 - C151
  • [44] The Development of Data Warehouse to Support Data Mining Technique for Traffic Accident Prediction
    Budiawan, Wiwik
    Saptadi, Singgih
    Arvianto, Ary
    [J]. 3RD INTERNATIONAL CONFERENCE ON ENERGY, ENVIRONMENTAL AND INFORMATION SYSTEM (ICENIS 2018), 2018, 73
  • [45] The Data Mining of the Human Resources Data Warehouse in University Based on Association Rule
    Zhang Danping
    Deng Jin
    [J]. JOURNAL OF COMPUTERS, 2011, 6 (01) : 139 - 146
  • [46] A new classification mining model based on the data warehouse
    Zhang, SL
    Zhang, JF
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 168 - 171
  • [47] Mining association rule efficiently based on data warehouse
    陈晓红
    赖邦传
    罗铤
    [J]. Journal of Central South University, 2003, (04) : 375 - 380
  • [48] Development of Data Warehouse For Leishmaniasis and Deployment of Data mining Process To Make Decision
    Mejhed, Habiba
    Boussaa, Samia
    Mejhed, Nour El Houda
    [J]. ACS'09: PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE, 2009, : 30 - 39
  • [49] Application of data warehouse and data mining in the steel enterprise information integration system
    Pei, Shenglei
    Jia, Guoqing
    [J]. 2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 2, 2014, : 181 - 184
  • [50] Validation of data from electronic data warehouse in diabetic ketoacidosis: Caution is needed
    VanderWeele, Jennifer
    Pollack, Teresa
    Oakes, Diana Johnson
    Smyrniotis, Colleen
    Illuri, Vidhya
    Vellanki, Priyathama
    O'Leary, Kevin
    Holl, Jane
    Aleppo, Grazia
    Molitch, Mark E.
    Walli, Amisha
    [J]. JOURNAL OF DIABETES AND ITS COMPLICATIONS, 2018, 32 (07) : 650 - 654