A potential predictive model based on machine learning and CPD parameters in elderly patients with aplastic anemia and myelodysplastic neoplasms

被引:0
|
作者
Qi, Yuxiang [1 ,2 ]
Liu, Xu [2 ,3 ]
Ding, Zhishan [1 ]
Yu, Ying [2 ]
Zhuang, Zhenchao [4 ]
机构
[1] Zhejiang Chinese Med Univ, Sch Med Technol & Informat Engn, Hangzhou, Peoples R China
[2] Zhejiang Chinese Med Univ, Zhejiang Prov Hosp Chinese Med, Dept Lab Med, Affiliated Hosp 1, Hangzhou, Peoples R China
[3] Zhejiang Chinese Med Univ, Sch Clin Med 1, Hangzhou, Peoples R China
[4] Adicon Clin Labs, Hangzhou 310023, Zhejiang, Peoples R China
关键词
Cell population data; Parameters; Aplastic anemia; Myelodysplastic neoplasms; Machine learning;
D O I
10.1186/s12911-024-02781-z
中图分类号
R-058 [];
学科分类号
摘要
BackgroundAplastic anemia (AA) and myelodysplastic neoplasms (MDS) have similar peripheral blood manifestations and are clinically characterized by reduced hematological triad. It is challenging to distinguish and diagnose these two diseases. Hence, utilizing machine learning methods, we employed and validated an algorithm that used cell population data (CPD) parameters to diagnose AA and MDS.MethodsIn this study, CPD parameters were obtained from the Beckman Coulter DxH800 analyzer for 160 individuals diagnosed with AA or MDS through a comprehensive retrospective analysis. The individuals were unselectively assigned to a training cohort (77%) and a testing cohort (23%). Additionally, an external validation cohort consisting of eighty-six elderly patients with AA and MDS from two additional centers was established. The discriminative parameters were carefully analyzed through univariate analysis, and the most predictive variables were selected using least absolute shrinkage and selection operator (LASSO) regression. Six machine learning algorithms were utilized to compare the performance of forecasting AA and MDS patients. The area under the curves (AUCs), calibration curves, decision curves analysis (DCA), and shapley additive explanations (SHAP) plots were employed to interpret and assess the model's predictive accuracy, clinical utility, and stability.ResultsAfter the comparative evaluation of various models, the logistic regression model emerged as the most suitable machine learning model for predicting the probability of AA and MDS, which utilized five principal variables (age, MNVLY, SDVLY, MNLALSEGC, and MNCEGC) to accurately estimate the risk of these diseases. The best model delivered an AUC of 0.791 in the testing cohort and had a high specificity (0.850) and positive predictive value (0.818). Furthermore, the calibration curve indicated excellent agreement between actual and predicted probabilities. The DCA curve further supported the clinical utility of our model and offered significant clinical advantages in guiding treatment decisions. Moreover, the model's performance was consistent in an external validation group, with an AUC of 0.719.ConclusionsWe developed a novel model that effectively distinguished elderly patients with AA and MDS, which had the potential to provide physicians assistance in early diagnosis and the proper treatment for the elderly.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] The outcome of allogeneic hematopoietic stem cell transplantation among elderly patients with severe aplastic anemia and a predictive model from the Chinese Blood and Marrow Transplant Registry group
    Xu, Zheng-Li
    Xu, Lan -Ping
    Zhang, Yi-Cheng
    Zhou, Yu -Hong
    Jiang, Er -Lie
    Zhang, Jian-Ping
    Fu, Bin
    Ouyang, Gui-Fang
    Song, Xian-Min
    Zhang, Xue-Jun
    Dong, Yu -Jun
    Li, Nai-Nong
    Wang, Ling
    Zhang, Xi
    He, Peng-Cheng
    Kong, Fan-Sheng
    Liu, Hui-Xia
    Liu, Li
    Liu, Lin
    Xiao, Tai -Wu
    Xu, Wen -Wei
    Xu, Xiao-Jun
    Yuan, Guo-Lin
    Yi, Hai
    Yu, Dan
    Yu, Li
    Huang, Xiao-Jun
    HAEMATOLOGICA, 2024, 109 (06) : 2000 - 2004
  • [22] Collaborative Production Planning with Unknown Parameters using Model Predictive Control and Machine Learning
    Chen, Yiyang
    Zhou, Yingwei
    Zhang, Yueyuan
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2185 - 2190
  • [23] Multivariate prediction model of geothermal parameters based on machine learning
    Zheng, Shuang-Fei
    Li, Xu
    Wang, Meng
    ENERGY, 2025, 316
  • [24] Model Predictive Controller Design For Bioprocesses Based On Machine Learning Algorithms
    Rashedi, Mohammad
    Khodabandehlou, Hamid
    Demers, Matthew
    Wang, Tony
    Garvin, Christopher
    IFAC PAPERSONLINE, 2022, 55 (07): : 45 - 50
  • [25] Machine Learning Based Predictive Model for Risk Assessment of Employee Attrition
    Gabrani, Goldie
    Kwatra, Anshul
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2018, PT IV, 2018, 10963 : 189 - 201
  • [26] Machine learning-based predictive model for prevention of metabolic syndrome
    Shin, Hyunseok
    Shim, Simon
    Oh, Sejong
    PLOS ONE, 2023, 18 (06):
  • [27] Machine Learning predictive model of grapevine yield based on agroclimatic patterns
    Sirsat M.S.
    Mendes-Moreira J.
    Ferreira C.
    Cunha M.
    Engineering in Agriculture, Environment and Food, 2019, 12 (04) : 443 - 450
  • [28] Machine learning-based predictive model for the development of thrombolysis resistance in patients with acute ischemic stroke
    Wang, Xiaorui
    Luo, Song
    Cui, Xue
    Qu, Hongdang
    Zhao, Yujie
    Liao, Qirong
    BMC NEUROLOGY, 2024, 24 (01)
  • [29] IDENTIFICATION OF FACTORS ASSOCIATED WITH DEATH IN DIALYSIS PATIENTS USING A MACHINE LEARNING-BASED PREDICTIVE MODEL
    de Almeida Vicentini, Carolina Aparecida
    Modelli de Andrade, Luis Gustavo
    Ponce, Daniela
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2023, 38 : I798 - I799
  • [30] FLARES IN PATIENTS WITH RECENT-ONSET PSORIATIC ARTHRITIS. PREDICTIVE MODEL BASED ON MACHINE LEARNING
    Silva, R. Queiro
    Seoane-Mato, D.
    Laiz, A.
    Galindez-Agirregoikoa, E.
    Canete, J. D. D.
    Gratacos-Masmitja, J.
    Juanola-Roura, X.
    Fiter, J.
    Marcos, A. Gonzalez
    ANNALS OF THE RHEUMATIC DISEASES, 2022, 81 : 405 - 406