Explainable Model Using Shapley Additive Explanations Approach on Wound Infection after Wide Soft Tissue Sarcoma Resection: "Big Data" Analysis Based on Health Insurance Review and Assessment Service Hub

被引:0
|
作者
Choi, Ji-Hye [1 ,2 ]
Choi, Yumin [3 ]
Lee, Kwang-Sig [4 ]
Ahn, Ki-Hoon [2 ,5 ]
Jang, Woo Young [1 ,2 ]
机构
[1] Korea Univ, Anam Hosp, Coll Med, Dept Orthoped Surg, 73 Goryeodae Ro, Seoul 02841, South Korea
[2] Korea Univ, Anam Hosp, Bloodless Med Ctr, Coll Med, Seoul 02841, South Korea
[3] Korea Univ, Coll Med, Sch Mech Engn, 73 Goryeodae Ro, Seoul 02841, South Korea
[4] Korea Univ, Anam Hosp, Coll Med, AI Ctr, 73 Goryeodae Ro, Seoul 02841, South Korea
[5] Korea Univ, Anam Hosp, Coll Med, Dept Obstet & Gynecol, Seoul 02841, South Korea
来源
MEDICINA-LITHUANIA | 2024年 / 60卷 / 02期
关键词
SHAP; Explainable AI; big data; soft tissue sarcoma; perioperative transfusion; POSTOPERATIVE INFECTIONS; RISK;
D O I
10.3390/medicina60020327
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and Objectives: Soft tissue sarcomas represent a heterogeneous group of malignant mesenchymal tissues. Despite their low prevalence, soft tissue sarcomas present clinical challenges for orthopedic surgeons owing to their aggressive nature, and perioperative wound infections. However, the low prevalence of soft tissue sarcomas has hindered the availability of large-scale studies. This study aimed to analyze wound infections after wide resection in patients with soft tissue sarcomas by employing big data analytics from the Hub of the Health Insurance Review and Assessment Service (HIRA). Materials and Methods: Patients who underwent wide excision of soft tissue sarcomas between 2010 and 2021 were included. Data were collected from the HIRA database of approximately 50 million individuals' information in the Republic of Korea. The data collected included demographic information, diagnoses, prescribed medications, and surgical procedures. Random forest has been used to analyze the major associated determinants. A total of 10,906 observations with complete data were divided into training and validation sets in an 80:20 ratio (8773 vs. 2193 cases). Random forest permutation importance was employed to identify the major predictors of infection and Shapley Additive Explanations (SHAP) values were derived to analyze the directions of associations with predictors. Results: A total of 10,969 patients who underwent wide excision of soft tissue sarcomas were included. Among the study population, 886 (8.08%) patients had post-operative infections requiring surgery. The overall transfusion rate for wide excision was 20.67% (2267 patients). Risk factors among the comorbidities of each patient with wound infection were analyzed and dependence plots of individual features were visualized. The transfusion dependence plot reveals a distinctive pattern, with SHAP values displaying a negative trend for individuals without blood transfusions and a positive trend for those who received blood transfusions, emphasizing the substantial impact of blood transfusions on the likelihood of wound infection. Conclusions: Using the machine learning random forest model and the SHAP values, the perioperative transfusion, male sex, old age, and low SES were important features of wound infection in soft-tissue sarcoma patients.
引用
收藏
页数:11
相关论文
共 1 条