共 50 条
Development and Validation of Machine Learning Models to Predict Epidermal Growth Factor Receptor Mutation in Non-Small Cell Lung Cancer: A Multi-Center Retrospective Radiomics Study
被引:8
|作者:
Liu, Yafeng
[1
]
Zhou, Jiawei
[1
]
Wu, Jing
[1
,2
]
Wang, Wenyang
[1
]
Wang, Xueqin
[1
]
Guo, Jianqiang
[1
]
Wang, Qingsen
[1
]
Zhang, Xin
[1
]
Li, Danting
[1
]
Xie, Jun
[3
]
Ding, Xuansheng
[1
,4
,5
]
Xing, Yingru
[1
,6
]
Hu, Dong
[1
,2
,3
]
机构:
[1] Anhui Univ Sci & Technol, Sch Med, Huainan, Peoples R China
[2] Anhui Univ Sci & Technol, Anhui Prov Engn Lab Occupat Hlth & Safety, Huainan, Peoples R China
[3] Anhui Univ Sci & Technol, Key Lab Ind Dust Prevent & Control & Occupat Safe, Minist Educ, Huainan, Peoples R China
[4] Anhui Univ Sci & Technol, Canc Hosp, Huainan, Peoples R China
[5] China Pharmaceut Univ, Sch Pharm, Nanjing, Peoples R China
[6] Anhui Zhongke Gengjiu Hosp, Dept Clin Lab, Hefei, Peoples R China
来源:
基金:
中国国家自然科学基金;
关键词:
non-small cell lung cancer;
epidermal growth factor receptor;
computed tomography;
radiomics;
machine learning;
ADENOCARCINOMA;
HETEROGENEITY;
PHENOTYPES;
IMAGES;
D O I:
10.1177/10732748221092926
中图分类号:
R73 [肿瘤学];
学科分类号:
100214 ;
摘要:
Objective To develop and validate a generalized prediction model that can classify epidermal growth factor receptor (EGFR) mutation status in non-small cell lung cancer patients. Methods A total of 346 patients (296 in the training cohort and 50 in the validation cohort) from four centers were included in this retrospective study. First, 1085 features were extracted using IBEX from the computed tomography images. The features were screened using the intraclass correlation coefficient, hypothesis tests and least absolute shrinkage and selection operator. Logistic regression (LR), decision tree (DT), random forest (RF), and support vector machine (SVM) were used to build a radiomics model for classification. The models were evaluated using the following metrics: area under the curve (AUC), calibration curve (CAL), decision curve analysis (DCA), concordance index (C-index), and Brier score. Results Sixteen features were selected, and models were built using LR, DT, RF, and SVM. In the training cohort, the AUCs was .723, .842, .995, and .883; In the validation cohort, the AUCs were .658, 0567, .88, and .765. RF model with the best AUC, its CAL, C-index (training cohort=.998; validation cohort=.883), and Brier score (training cohort=.007; validation cohort=0.137) showed a satisfactory predictive accuracy; DCA indicated that the RF model has better clinical application value. Conclusion Machine learning models based on computed tomography images can be used to evaluate EGFR status in patients with non-small cell lung cancer, and the RF model outperformed LR, DT, and SVM.
引用
收藏
页数:8
相关论文