Utilizing electronic health record data to understand comorbidity burden among people living with HIV: a machine learning approach

被引:15
|
作者
Yang, Xueying [1 ,2 ]
Zhang, Jiajia [1 ,3 ]
Chen, Shujie [1 ,3 ]
Weissman, Sharon [4 ]
Olatosi, Bankole [1 ,5 ]
Li, Xiaoming [1 ,2 ]
机构
[1] Univ South Carolina, South Carolina SmartState Ctr Healthcare Qual, Columbia, SC 29208 USA
[2] Univ South Carolina, Arnold Sch Publ Hlth, Dept Hlth Promot Educ & Behav, Columbia, SC 29208 USA
[3] Univ South Carolina, Dept Epidemiol & Biostat, Arnold Sch Publ Hlth, Columbia, SC 29208 USA
[4] Univ South Carolina, Dept Internal Med, Sch Med, Columbia, SC 29208 USA
[5] Univ South Carolina, Arnold Sch Publ Hlth, Dept Hlth Serv Policy & Management, Columbia, SC 29208 USA
基金
美国国家卫生研究院;
关键词
comorbidity; electronic health record; HIV; AIDS; lasso regression; machine learning; South Carolina; ACTIVE ANTIRETROVIRAL THERAPY; AGE-RELATED COMORBIDITIES; CHRONIC KIDNEY-DISEASE; INFECTED PATIENTS; FAMILY-HISTORY; CLINICAL COMORBIDITY; SOUTH-CAROLINA; MYOCARDIAL-INFARCTION; CIGARETTE-SMOKING; RISK PREDICTION;
D O I
10.1097/QAD.0000000000002736
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
Objectives: An understanding of the predictors of comorbidity among people living with HIV (PLWH) is critical for effective HIV care management. In this study, we identified predictors of comorbidity burden among PLWH based on machine learning models with electronic health record (EHR) data. Methods: The study population are individuals with a HIV diagnosis between January 2005 and December 2016 in South Carolina (SC). The change of comorbidity burden, represented by the Charlson Comorbidity Index (CCI) score, was measured by the score difference between pre- and post-HIV diagnosis, and dichotomized into a binary outcome variable. Thirty-five risk predictors from multiple domains were used to predict the increase in comorbidity burden based on the logistic least absolute shrinkage and selection operator (Lasso) regression analysis using 80% data for model development and 20% data for validation. Results: Of 8253 PLWH, the mean value of the CCI score difference was 0.8 +/- 1.9 (range from 0 to 21) with 2328 (28.2%) patients showing an increase in CCI score after HIV diagnosis. Top predictors for an increase in CCI score using the LASSO model included older age at HIV diagnosis, positive family history of chronic conditions, tobacco use, longer duration with retention in care, having PEBA insurance, having low recent CD4(+) cell count and duration of viral suppression. Conclusion: The application of machine learning methods to EHR data could identify important predictors of increased comorbidity burden among PLWH with high accuracy. Results may enhance the understanding of comorbidities and provide the evidence based data for integrated HIV and comorbidity care management of PLWH.
引用
收藏
页码:S39 / S51
页数:13
相关论文
共 50 条
  • [31] Towards a Universal Privacy Model for Electronic Health Record Systems: An Ontology and Machine Learning Approach
    Nowrozy, Raza
    Ahmed, Khandakar
    Wang, Hua
    Mcintosh, Timothy
    INFORMATICS-BASEL, 2023, 10 (03):
  • [32] A Phenotyping Algorithm to Identify People With HIV in Electronic Health Record Data (HIV-Phen): Development and Evaluation Study
    May, Sarah B.
    Giordano, Thomas P.
    Gottlieb, Assaf
    JMIR FORMATIVE RESEARCH, 2021, 5 (11)
  • [33] Predicting Intensive Care Unit Readmission with Machine Learning Using Electronic Health Record Data
    Rojas, J. C.
    Carey, K. A.
    Edelson, D. P.
    Venable, L. R.
    Howell, M. D.
    Churpek, M. M.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2018, 197
  • [34] Postoperative delirium prediction using machine learning models and preoperative electronic health record data
    Andrew Bishara
    Catherine Chiu
    Elizabeth L. Whitlock
    Vanja C. Douglas
    Sei Lee
    Atul J. Butte
    Jacqueline M. Leung
    Anne L. Donovan
    BMC Anesthesiology, 22
  • [35] Using Natural Language Processing and Machine Learning to Identify Opioids in Electronic Health Record Data
    McDermott, Sean P.
    Wasan, Ajay D.
    JOURNAL OF PAIN RESEARCH, 2023, 16 : 2133 - 2140
  • [36] Prediction of Atherosclerotic Cardiovascular Disease Risk Using Machine Learning and Electronic Health Record Data
    Ward, Andrew
    Sarraju, Ashish
    Chung, Sukyung
    Palaniappan, Latha
    Scheinker, David
    Rodriguez, Fatima
    CIRCULATION, 2019, 140
  • [37] Postoperative delirium prediction using machine learning models and preoperative electronic health record data
    Bishara, Andrew
    Chiu, Catherine
    Whitlock, Elizabeth L.
    Douglas, Vanja C.
    Lee, Sei
    Butte, Atul J.
    Leung, Jacqueline M.
    Donovan, Anne L.
    BMC ANESTHESIOLOGY, 2022, 22 (01)
  • [38] Predicting Intensive Care Unit Readmission with Machine Learning Using Electronic Health Record Data
    Rojas, Juan C.
    Carey, Kyle A.
    Edelson, Dana P.
    Venable, Laura R.
    Howell, Michael D.
    Churpek, Matthew M.
    ANNALS OF THE AMERICAN THORACIC SOCIETY, 2018, 15 (07) : 846 - 853
  • [39] Utilizing Electronic Health Record Information to Optimize Medication Infusion Devices: A Manual Data Integration Approach
    Chuk, Amanda
    Maloney, Robert
    Gawron, Joyce
    Skinner, Colin
    JOURNAL FOR HEALTHCARE QUALITY, 2016, 38 (06) : 370 - 378
  • [40] A machine learning approach to identify distinct subgroups of veterans at risk for hospitalization or death using administrative and electronic health record data
    Parikh, Ravi B.
    Linn, Kristin A.
    Yan, Jiali
    Maciejewski, Matthew L.
    Rosland, Ann-Marie
    Volpp, Kevin G.
    Groeneveld, Peter W.
    Navathe, Amol S.
    PLOS ONE, 2021, 16 (02):