Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage

被引：38

作者：

Cui, Sunan ^{[1
]}

Luo, Yi ^{[2
]}

Tseng, Huan-Hsin ^{[2
]}

Ten Haken, Randall K. ^{[2
]}

El Naga, Issam ^{[2
]}

机构：

[1] Univ Michigan, Appl Phys Program, Ann Arbor, MI 48109 USA

[2] Univ Michigan, Dept Radiat Oncol, Ann Arbor, MI 48109 USA

来源：

MEDICAL PHYSICS | 2019年 / 46卷 / 05期

基金：

美国国家卫生研究院;

关键词：

deep neural networks; feature selection; machine learning; radiotherapy outcome modeling; RADIOTHERAPY OUTCOMES; FEATURE-SELECTION; NEURAL-NETWORK; DOSE-VOLUME; PNEUMONITIS; CANCER; MODEL; IRRADIATION;

D O I：

10.1002/mp.13497

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

Purpose There has been burgeoning interest in applying machine learning methods for predicting radiotherapy outcomes. However, the imbalanced ratio of a large number of variables to a limited sample size in radiation oncology constitutes a major challenge. Therefore, dimensionality reduction methods can be a key to success. The study investigates and contrasts the application of traditional machine learning methods and deep learning approaches for outcome modeling in radiotherapy. In particular, new joint architectures based on variational autoencoder (VAE) for dimensionality reduction are presented and their application is demonstrated for the prediction of lung radiation pneumonitis (RP) from a large-scale heterogeneous dataset. Methods A large-scale heterogeneous dataset containing a pool of 230 variables including clinical factors (e.g., dose, KPS, stage) and biomarkers (e.g., single nucleotide polymorphisms (SNPs), cytokines, and micro-RNAs) in a population of 106 nonsmall cell lung cancer (NSCLC) patients who received radiotherapy was used for modeling RP. Twenty-two patients had grade 2 or higher RP. Four methods were investigated, including feature selection (case A) and feature extraction (case B) with traditional machine learning methods, a VAE-MLP joint architecture (case C) with deep learning and lastly, the combination of feature selection and joint architecture (case D). For feature selection, Random forest (RF), Support Vector Machine (SVM), and multilayer perceptron (MLP) were implemented to select relevant features. Specifically, each method was run for multiple times to rank features within several cross-validated (CV) resampled sets. A collection of ranking lists were then aggregated by top 5% and Kemeny graph methods to identify the final ranking for prediction. A synthetic minority oversampling technique was applied to correct for class imbalance during this process. For deep learning, a VAE-MLP joint architecture where a VAE aimed for dimensionality reduction and an MLP aimed for classification was developed. In this architecture, reconstruction loss and prediction loss were combined into a single loss function to realize simultaneous training and weights were assigned to different classes to mitigate class imbalance. To evaluate the prediction performance and conduct comparisons, the area under receiver operating characteristic curves (AUCs) were performed for nested CVs for both handcrafted feature selections and the deep learning approach. The significance of differences in AUCs was assessed using the DeLong test of U-statistics. Results An MLP-based method using weight pruning (WP) feature selection yielded the best performance among the different hand-crafted feature selection methods (case A), reaching an AUC of 0.804 (95% CI: 0.761-0.823) with 29 top features. A VAE-MLP joint architecture (case C) achieved a comparable but slightly lower AUC of 0.781 (95% CI: 0.737-0.808) with the size of latent dimension being 2. The combination of handcrafted features (case A) and latent representation (case D) achieved a significant AUC improvement of 0.831 (95% CI: 0.805-0.863) with 22 features (P-value = 0.000642 compared with handcrafted features only (Case A) and P-value = 0.000453 compared to VAE alone (Case C)) with an MLP classifier. Conclusion The potential for combination of traditional machine learning methods and deep learning VAE techniques has been demonstrated for dealing with limited datasets in modeling radiotherapy toxicities. Specifically, latent variables from a VAE-MLP joint architecture are able to complement handcrafted features for the prediction of RP and improve prediction over either method alone.(c) 2019 American Association of Physicists in Medicine

引用

页码：2497 / 2511

页数：15

共 50 条

[1] Long term radiological features of radiation-induced lung damage
Veiga, Catarina
Landau, David
McClelland, Jamie R.
Ledermann, Jonathan A.
Hawkes, David
Janes, Sam M.
Devaraj, Anand
[J]. RADIOTHERAPY AND ONCOLOGY, 2018, 126 (02) : 300 - 306
[2] Towards a better prediction of radiation-induced lung damage (RILD)
Houben, A.
Aerts, H.
Bosmans, G.
Duisters, C.
Emans, D.
Lambin, P.
Dekker, A.
De Ruysscher, D.
[J]. RADIOTHERAPY AND ONCOLOGY, 2007, 84 : S160 - S160
[3] Radiomics analysis on CT images for prediction of radiation-induced kidney damage by machine learning models
Amiri, Sepideh
Akbarabadi, Mina
Abdolali, Fatemeh
Nikoofar, Alireza
Esfahani, Azam Janati
Cheraghi, Susan
[J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 133
[4] Radiation-induced depassivation of latent plasma damage
Cellere, G
Paccagnella, A
Pantisano, L
Valentini, MG
Flament, O
Mousseau, O
Fuochi, PG
[J]. MICROELECTRONIC ENGINEERING, 2002, 60 (3-4) : 439 - 450
[5] THE PATHOGENESIS OF RADIATION-INDUCED LUNG DAMAGE
GROSS, NJ
[J]. LUNG, 1981, 159 (03) : 115 - 125
[6] The limitations of dosimetric parameters for the prediction of radiation-induced lung toxicity: An approach based on machine learning techniques
Dehign-Oberije, C.
Fung, G.
De Ruysscher, D.
van der Weide, H.
Krishna, S.
Ra, R. B.
Lambin, P.
[J]. INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2007, 69 (03): : S488 - S489
[7] Robust Normal Lung CT Texture Features for the Prediction of Radiation-Induced Lung Disease
Choi, W.
Riyahi, S.
Liu, C. J.
Lu, W.
[J]. INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2017, 99 (02): : S196 - S197
[8] Lung Nodule Malignancy Prediction by Combining Handcrafted Features and Deep Convolutional Neural Network
Li, S.
Chen, L.
Zhou, Z.
Hao, H.
Duan, Y.
Li, B.
Folkert, M.
Jiang, S.
Wang, J.
[J]. MEDICAL PHYSICS, 2018, 45 (06) : E668 - E669
[9] PREDICTION OF RADIATION-INDUCED LUNG DAMAGE USING BIOLOGY-BASED MODELS
van Luijk, P.
Muijs, C.
Faber, H.
Schippers, M.
Brandenburg, S.
Langendijk, J. A.
Coppes, R.
[J]. RADIOTHERAPY AND ONCOLOGY, 2009, 92 : S244 - S244
[10] Prediction of lung radiation-induced pneumonitis using the support vector machine algorithm
Chen, S.
Zhou, S.
Zhang, J.
Marks, L.
Das, S.
[J]. MEDICAL PHYSICS, 2007, 34 (06) : 2602 - 2603

← 1 2 3 4 5 →