Simulation of a machine learning enabled learning health system for risk prediction using synthetic patient data

被引:0
|
作者
Anjun Chen
Drake O. Chen
机构
[1] LHS Technology Forum Initiative,
[2] Learning Health Community,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
When enabled by machine learning (ML), Learning Health Systems (LHS) hold promise for improving the effectiveness of healthcare delivery to patients. One major barrier to LHS research and development is the lack of access to EHR patient data. To overcome this challenge, this study demonstrated the feasibility of developing a simulated ML-enabled LHS using synthetic patient data. The ML-enabled LHS was initialized using a dataset of 30,000 synthetic Synthea patients and a risk prediction XGBoost base model for lung cancer. 4 additional datasets of 30,000 patients were generated and added to the previous updated dataset sequentially to simulate addition of new patients, resulting in datasets of 60,000, 90,000, 120,000 and 150,000 patients. New XGBoost models were built in each instance, and performance improved with data size increase, attaining 0.936 recall and 0.962 AUC (area under curve) in the 150,000 patients dataset. The effectiveness of the new ML-enabled LHS process was verified by implementing XGBoost models for stroke risk prediction on the same Synthea patient populations. By making the ML code and synthetic patient data publicly available for testing and training, this first synthetic LHS process paves the way for more researchers to start developing LHS with real patient data.
引用
收藏
相关论文
共 50 条
  • [1] Simulation of a machine learning enabled learning health system for risk prediction using synthetic patient data
    Chen, Anjun
    Chen, Drake O.
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [2] Cardiovascular disease risk prediction via machine learning using mental health data
    Dorraki, M.
    Liao, Z.
    Abbott, D.
    Psaltis, P. J.
    Baker, E.
    Bidargaddi, N.
    Van Den Hengel, A.
    Narula, J.
    Verjans, J. W.
    [J]. EUROPEAN HEART JOURNAL, 2022, 43 : 2784 - 2784
  • [3] Prediction of Human Health using Machine Learning and Big Data
    Fahad, P. K.
    Pallavi, M. S.
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 195 - 199
  • [4] IoT Enabled Crop Prediction and Irrigation Automation System Using Machine Learning
    Kumar, Raj
    Singhal, Vivek
    [J]. Recent Advances in Computer Science and Communications, 2022, 15 (01) : 88 - 97
  • [5] Trends in Using IoT with Machine Learning in Health Prediction System
    Aldahiri, Amani
    Alrashed, Bashair
    Hussain, Walayat
    [J]. FORECASTING, 2021, 3 (01): : 181 - 206
  • [6] Chaotic System Prediction Using Data Assimilation and Machine Learning
    Guo Yanan
    Cao Xiaoqun
    Peng Kecheng
    [J]. 2020 INTERNATIONAL CONFERENCE ON ENERGY, ENVIRONMENT AND BIOENGINEERING (ICEEB 2020), 2020, 185
  • [7] Machine Learning for Prediction in Electronic Health Data
    Rose, Sherri
    [J]. JAMA NETWORK OPEN, 2018, 1 (04)
  • [8] Prediction of Atherosclerotic Cardiovascular Disease Risk Using Machine Learning and Electronic Health Record Data
    Ward, Andrew
    Sarraju, Ashish
    Chung, Sukyung
    Palaniappan, Latha
    Scheinker, David
    Rodriguez, Fatima
    [J]. CIRCULATION, 2019, 140
  • [9] Machine Learning Approaches for Prediction of Facial Rejuvenation Using Real and Synthetic Data
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    Molton, Michael K.
    [J]. IEEE ACCESS, 2019, 7 : 23779 - 23787
  • [10] Road Car Accident Prediction Using a Machine-Learning-Enabled Data Analysis
    Ardakani, Saeid Pourroostaei
    Liang, Xiangning
    Mengistu, Kal Tenna
    So, Richard Sugianto
    Wei, Xuhui
    He, Baojie
    Cheshmehzangi, Ali
    [J]. SUSTAINABILITY, 2023, 15 (07)