Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithm

被引:0
|
作者
Fan, Xuehui [1 ]
Ye, Ruixue [1 ]
Gao, Yan [1 ]
Xue, Kaiwen [1 ]
Zhang, Zeyu [1 ]
Xu, Jing [1 ]
Zhao, Jingpu [1 ]
Feng, Jun [2 ]
Wang, Yulong [1 ]
机构
[1] Shenzhen Univ, Affiliated Hosp 1, Peoples Hosp Shenzhen 2, Dept Rehabil Med, Shenzhen, Guangdong, Peoples R China
[2] Linping Hosp Integrated Tradit Chinese & Western M, Hangzhou, Zhejiang, Peoples R China
来源
关键词
XGBoost; machine learning algorithm; rehabilitation patient; graded diagnosis and treatment; treatment preferences; HEALTH-CARE; CLASSIFICATION; CHALLENGES; MORTALITY; COVERAGE; CHINA;
D O I
10.3389/frai.2024.1473837
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background The Department of Rehabilitation Medicine is key to improving patients' quality of life. Driven by chronic diseases and an aging population, there is a need to enhance the efficiency and resource allocation of outpatient facilities. This study aims to analyze the treatment preferences of outpatient rehabilitation patients by using data and a grading tool to establish predictive models. The goal is to improve patient visit efficiency and optimize resource allocation through these predictive models.Methods Data were collected from 38 Chinese institutions, including 4,244 patients visiting outpatient rehabilitation clinics. Data processing was conducted using Python software. The pandas library was used for data cleaning and preprocessing, involving 68 categorical and 12 continuous variables. The steps included handling missing values, data normalization, and encoding conversion. The data were divided into 80% training and 20% test sets using the Scikit-learn library to ensure model independence and prevent overfitting. Performance comparisons among XGBoost, random forest, and logistic regression were conducted using metrics, including accuracy and receiver operating characteristic (ROC) curves. The imbalanced learning library's SMOTE technique was used to address the sample imbalance during model training. The model was optimized using a confusion matrix and feature importance analysis, and partial dependence plots (PDP) were used to analyze the key influencing factors.Results XGBoost achieved the highest overall accuracy of 80.21% with high precision and recall in Category 1. random forest showed a similar overall accuracy. Logistic Regression had a significantly lower accuracy, indicating difficulties with nonlinear data. The key influencing factors identified include distance to medical institutions, arrival time, length of hospital stay, and specific diseases, such as cardiovascular, pulmonary, oncological, and orthopedic conditions. The tiered diagnosis and treatment tool effectively helped doctors assess patients' conditions and recommend suitable medical institutions based on rehabilitation grading.Conclusion This study confirmed that ensemble learning methods, particularly XGBoost, outperform single models in classification tasks involving complex datasets. Addressing class imbalance and enhancing feature engineering can further improve model performance. Understanding patient preferences and the factors influencing medical institution selection can guide healthcare policies to optimize resource allocation, improve service quality, and enhance patient satisfaction. Tiered diagnosis and treatment tools play a crucial role in helping doctors evaluate patient conditions and make informed recommendations for appropriate medical care.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Enhancing the prediction of student performance based on the machine learning XGBoost algorithm
    Asselman, Amal
    Khaldi, Mohamed
    Aammou, Souhaib
    INTERACTIVE LEARNING ENVIRONMENTS, 2023, 31 (06) : 3360 - 3379
  • [2] XGBoost Machine Learning Algorithm for Prediction of Outcome in Aneurysmal Subarachnoid Hemorrhage
    Wang, Ruoran
    Zhang, Jing
    Shan, Baoyin
    He, Min
    Xu, Jianguo
    NEUROPSYCHIATRIC DISEASE AND TREATMENT, 2022, 18 : 659 - 667
  • [3] Improvised_XgBoost Machine learning Algorithm for Customer Churn Prediction
    Swetha P.
    Dayananda R.B.
    EAI Endorsed Transactions on Energy Web, 2020, 7 (30) : 1 - 7
  • [4] Rockburst Prediction Based on Particle Swarm Optimization and Machine Learning Algorithm
    Liu, Yaoru
    Hu, Shaokang
    INFORMATION TECHNOLOGY IN GEO-ENGINEERING, 2020, : 292 - 303
  • [5] Interpretable Learning Algorithm Based on XGBoost for Fault Prediction in Optical Network
    Zhang, Chunyu
    Wang, Danshi
    Song, Chuang
    Wang, Lingling
    Song, Jianan
    Guan, Luyao
    Zhang, Min
    2020 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXPOSITION (OFC), 2020,
  • [6] Prediction of Heart Disease Based on Machine Learning Using Jellyfish Optimization Algorithm
    Ahmad, Ahmad Ayid
    Polat, Huseyin
    DIAGNOSTICS, 2023, 13 (14)
  • [7] Prediction and Optimization of Blast Furnace Parameters Based on Machine Learning and Genetic Algorithm
    Li Z.-N.
    Chu M.-S.
    Liu Z.-G.
    Li B.-F.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2020, 41 (09): : 1262 - 1267
  • [8] Machine-Learning-Based No Show Prediction in Outpatient Visits
    Elvira, C.
    Ochoa, A.
    Gonzalvez, J. C.
    Mochon, F.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2018, 4 (07): : 29 - 34
  • [9] Prediction of Crops Based on a Machine Learning Algorithm
    SRM Institute of Science and Technology, Department of Networking and Communications, Tamil Nadu, Chennai
    603203, India
    Int. Conf. Comput. Commun. Informatics, ICCCI, 2023,
  • [10] Investment estimation of prefabricated concrete buildings based on XGBoost machine learning algorithm
    Yan, Hongyan
    He, Zheng
    Gao, Ce
    Xie, Mingjing
    Sheng, Haoyu
    Chen, Huihua
    ADVANCED ENGINEERING INFORMATICS, 2022, 54