Machine learning-driven prediction of hospital admissions using gradient boosting and GPT-2

被引:0
|
作者
Zhang, Xingyu [1 ]
Wang, Hairong [3 ]
Yu, Guan [4 ]
Zhang, Wenbin [2 ]
机构
[1] Univ Pittsburgh, Sch Hlth & Rehabil Sci, Dept Commun Sci & Disorders, Pittsburgh, PA USA
[2] Florida Int Univ, Knight Fdn Sch Comp & Informat Sci, Miami, FL USA
[3] Georgia Inst Technol, Coll Comp, Atlanta, GA USA
[4] Univ Pittsburgh, Sch Publ Hlth, Dept Biostat & Hlth Data Sci, Pittsburgh, PA USA
来源
DIGITAL HEALTH | 2025年 / 11卷
关键词
Hospital admission prediction; emergency department; machine learning; natural language processing; TRIAGE;
D O I
10.1177/20552076251331319
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Accurately predicting hospital admissions from the emergency department (ED) is essential for improving patient care and resource allocation. This study aimed to predict hospital admissions by integrating both structured clinical data and unstructured text data using machine learning models. Methods: Data were obtained from the 2021 National Hospital Ambulatory Medical Care Survey-Emergency Department (NHAMCS-ED), including adult patients aged 18 years and older. Structured data included demographics, visit characteristics, vital signs, and medical history, while unstructured data consisted of free-text chief complaints and injury descriptions. A Gradient Boosting Classifier (GBC) was applied to structured data, while a fine-tuned GPT-2 model processed the unstructured text. A combined model was created by averaging the outputs of both models. Model performance was evaluated using 5-fold cross-validation, assessing accuracy, precision, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC-ROC). Results: Among the 13,115 patients, 2264 (17.3%) were admitted to the hospital. The combined model outperformed the individual structured and unstructured models, achieving an accuracy of 75.8%, precision of 39.5%, sensitivity of 75.8%, and specificity of 75.8%. In comparison, the structured data model achieved 73.8% accuracy, while the unstructured model reached 64.6%. The combined model had the highest AUC, indicating superior performance. Conclusions: Combining structured and unstructured data using machine learning significantly improves the prediction of hospital admissions from the ED. This integrated approach can enhance decision-making and optimize ED operations.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Machine learning-driven structure prediction for iron hydrides
    Tahmasbi, Hossein
    Ramakrishna, Kushal
    Lokamani, Mani
    Cangi, Attila
    PHYSICAL REVIEW MATERIALS, 2024, 8 (03):
  • [2] Machine Learning-Driven Methods for Nanobody Affinity Prediction
    Feng, Hua
    Sun, Xuefeng
    Li, Ning
    Xu, Qian
    Li, Qin
    Zhang, Shenli
    Xing, Guangxu
    Zhang, Gaiping
    Wang, Fangyu
    ACS OMEGA, 2024, 9 (48): : 47893 - 47902
  • [3] Machine Learning-Driven Prediction of Peritoneal Dialysis Technique Failure
    Monaghan, Caitlin
    Willetts, Joanna
    Han, Hao
    Kraus, Michael A.
    Chatoth, Dinesh K.
    Usvyat, Len A.
    Maddux, Franklin W.
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2021, 32 (10): : 329 - 329
  • [4] Machine learning-driven insights into phase prediction for high entropy alloys
    Jain, Reliance
    Jain, Sandeep
    Dewangan, Sheetal Kumar
    Boriwal, Lokesh Kumar
    Samal, Sumanta
    Journal of Alloys and Metallurgical Systems, 2024, 8
  • [5] Machine learning-driven catalyst design, synthesis and performance prediction for CO2 hydrogenation
    Asif, Muhammad
    Yao, Chengxi
    Zuo, Zitu
    Bilal, Muhammad
    Zeb, Hassan
    Lee, Seungjae
    Wang, Ziyang
    Kim, Taesung
    JOURNAL OF INDUSTRIAL AND ENGINEERING CHEMISTRY, 2025, 144 : 32 - 47
  • [6] Boosting Convergence of Timing Closure using Feature Selection in a Learning-driven Approach
    Que Yanghua
    Ng, Harnhua
    Kapre, Nachiket
    2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,
  • [7] Machine Learning-Driven Scattering Efficiency Prediction in Passive Daytime Radiative Cooling
    Shi, Changmin
    Zheng, Jiayu
    Wang, Ying
    Gan, Chenjie
    Zhang, Liwen
    Sheldon, Brian W.
    ATMOSPHERE, 2025, 16 (01)
  • [8] Deep learning versus gradient boosting machine for pan evaporation prediction
    Malik, Anurag
    Saggi, Mandeep Kaur
    Rehman, Sufia
    Sajjad, Haroon
    Inyurt, Samed
    Bhatia, Amandeep Singh
    Farooque, Aitazaz Ahsan
    Oudah, Atheer Y.
    Yaseen, Zaher Mundher
    ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2022, 16 (01) : 570 - 587
  • [9] Machine Learning-Driven Trust Prediction for MEC-based IoT Services
    Abeysekara, Prabath
    Dong, Hai
    Qin, A. K.
    2019 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2019), 2019, : 188 - 192
  • [10] An Improved Machine Learning-Driven Framework for Cryptocurrencies Price Prediction With Sentimental Cautioning
    Zubair, Muhammad
    Ali, Jaffar
    Alhussein, Musaed
    Hassan, Shoaib
    Aurangzeb, Khursheed
    Umair, Muhammad
    IEEE ACCESS, 2024, 12 : 51395 - 51418