Development and validation of machine learning models for predicting venous thromboembolism in colorectal cancer patients: A cohort study in China

被引:0
|
作者
机构
[1] Hu, Zuhai
[2] Li, Xiaosheng
[3] Yuan, Yuliang
[4] Xu, Qianjie
[5] Zhang, Wei
[6] Lei, Haike
关键词
Contrastive Learning - Diseases - Prediction models - Risk assessment - Self-supervised learning;
D O I
10.1016/j.ijmedinf.2024.105770
中图分类号
学科分类号
摘要
Background: With advancements in healthcare, traditional VTE risk assessment tools are increasingly insufficient to meet the demands of high-quality care, underscoring the need for innovative and specialized assessment methods. Objective: Owing to the remarkable success of machine learning in supervised learning and disease prediction, our objective is to develop a reliable and efficient model for assessing VTE risk by leveraging the fundamental data and clinical characteristics of colorectal cancer patients within our medical facility. Methods: Six commonly used machine learning algorithms were utilized in our study to predict the occurrence of VTE in patients with rectal cancer. In the modeling process, LASSO regression was employed to identify and exclude variables not associated with VTE. Additionally, hyperparameter tuning was conducted via 5-fold cross-validation to mitigate overfitting, and 200 bootstrap samples were used to adjust the apparent performance on the training set. The selection of the VTE assessment model was determined by a thorough evaluation of performance criteria, such as the AUC, ACC and F1 score. Results: The RF model exhibits consistent and efficient performance. Specifically, in the internally validation dataset, where generalizability was adjusted, the RF model achieved the highest scores across multiple metrics: AD-AUC (0.895), AD-ACC (0.871), AD-F1 (0.311), AD-MCC (0.316), AD-Precision (0.241), AD-Specificity (0.888). For external validation on unseen colon cancer data, the RF model also performed best in terms of ACC (0.728), F1 (0.292), MCC (0.225), Precision (0.192), and Specificity (0.740), with a suboptimal AUC of 0.745 and a Sensitivity (Recall) of 0.615. Additionally, the RF model demonstrates strong performance not only on the original dataset but also on datasets processed via alternative imbalance handling techniques. Conclusions: Our research successfully established and validated a risk assessment model for assessing the risk of VTE in colorectal cancer patients. © 2024 Elsevier B.V.
引用
收藏
相关论文
共 50 条
  • [1] Development and validation of machine learning models for postoperative venous thromboembolism prediction in colorectal cancer inpatients: a retrospective study br
    Qin, Li
    Liang, Zhikun
    Xie, Jingwen
    Ye, Guozeng
    Guan, Pengcheng
    Huang, Yaoyao
    Li, Xiaoyan
    JOURNAL OF GASTROINTESTINAL ONCOLOGY, 2023, 14 (01) : 220 - +
  • [2] Development and validation of machine learning models for venous thromboembolism risk assessment at admission: a retrospective study
    Sheng, Wenbo
    Wang, Xiaoli
    Xu, Wenxiang
    Hao, Zedong
    Ma, Handong
    Zhang, Shaodian
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2023, 10
  • [3] Development and validation of a nomogram model for predicting venous thromboembolism risk in lung cancer patients treated with immune checkpoint inhibitors: A cohort study in China
    Liang, Guanzhong
    Hu, Zuhai
    Xu, Qianjie
    Wang, Guixue
    Wang, Ying
    Li, Xiaosheng
    Zhang, Wei
    Lei, Haike
    CANCER MEDICINE, 2024, 13 (16):
  • [4] Development and validation of a machine learning model to predict venous thromboembolism among hospitalized cancer patients
    Meng, Lingqi
    Wei, Tao
    Fan, Rongrong
    Su, Haoze
    Liu, Jiahui
    Wang, Lijie
    Huang, Xinjuan
    Qi, Yi
    Li, Xuying
    ASIA-PACIFIC JOURNAL OF ONCOLOGY NURSING, 2022, 9 (12)
  • [5] Predicting Venous Thromboembolism Recurrence Risk in Patients with Cancer: A Validation Study
    Louzada, Martha L.
    Bose, Gauruv
    Cheung, Andrew
    Chin-Yee, Benjamin H.
    Wells, Simon
    Wells, Philip S.
    BLOOD, 2012, 120 (21)
  • [6] Predicting venous thromboembolism (VTE) risk in cancer patients using machine learning
    Townsley, Samir Khan
    Basu, Debraj
    Vora, Jayneel
    Wun, Ted
    Chuah, Chen-Nee
    Shankar, Prabhu R. V.
    HEALTH CARE SCIENCE, 2023, 2 (04): : 205 - 222
  • [7] Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning
    Lei, Haike
    Zhang, Mengyang
    Wu, Zeyi
    Liu, Chun
    Li, Xiaosheng
    Zhou, Wei
    Long, Bo
    Ma, Jiayang
    Zhang, Huiyi
    Wang, Ying
    Wang, Guixue
    Gong, Mengchun
    Hong, Na
    Liu, Haixia
    Wu, Yongzhong
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [8] Development and validation of a nomogram model for predicting the risk of venous thromboembolism in lymphoma patients undergoing chemotherapy: a prospective cohort study conducted in China
    Liang, Guanzhong
    Li, Xiaosheng
    Xu, Qianjie
    Yang, Zailin
    Li, Jieping
    Yang, Tao
    Wang, Guixue
    Lei, Haike
    ANNALS OF MEDICINE, 2023, 55 (02)
  • [9] Predicting the risk of recurrent venous thromboembolism in patients with cancer: A prospective cohort study
    van Es, Nick
    Louzada, Martha
    Carrier, Marc
    Tagalakis, Vicky
    Gross, Peter L.
    Shivakumar, Sudeep
    Rodger, Marc A.
    Wells, Philip S.
    THROMBOSIS RESEARCH, 2018, 163 : 41 - 46
  • [10] Development and Validation of a Machine Learning Model for Predicting the Prognosis of Older Patients with Gastric Cancer: A Retrospective, Population-Based Cohort Study In China
    Huang, Ze-Ning
    Zhang, Xing-Qi
    Wu, Ju
    Zheng, Chang-Yue
    Xie, Rong-Zhen
    Wu, Ying-Xin
    Liu, Xiao-Dong
    Chen, Qi-Yue
    Xie, Jian-Wei
    Li, Ping
    Zheng, Chao-Hui
    Lin, Jian-Xian
    Zhou, Yanbing
    Huang, Chang-Ming
    SSRN,