Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer

被引:0
|
作者
Kovacs, David G. [1 ,2 ]
Ladefoged, Claes N. [1 ,3 ]
Andersen, Kim F. [1 ]
Brittain, Jane M. [1 ]
Christensen, Charlotte B. [4 ]
Dejanovic, Danijela [1 ]
Hansen, Naja L. [1 ]
Loft, Annika [1 ]
Petersen, Jorgen H. [5 ]
Reichkendler, Michala [1 ]
Andersen, Flemming L. [1 ,2 ]
Fischer, Barbara M. [1 ,2 ,6 ]
机构
[1] Univ Copenhagen, Dept Clin Physiol & Nucl Med, Copenhagen, Denmark
[2] Univ Copenhagen, Fac Hlth & Med Sci, Dept Clin Med, Copenhagen, Denmark
[3] Tech Univ Denmark, Dept Appl Math & Comp Sci, Lyngby, Denmark
[4] Univ Copenhagen, Herlev Hosp, Dept Clin Physiol & Nucl Med, Copenhagen, Denmark
[5] Univ Copenhagen, Inst Publ Hlth, Fac Hlth Sci, Sect Biostat, Copenhagen, Denmark
[6] Kings Coll London, PET Ctr, Sch Biomed Engn & Imaging Sci, London, England
关键词
18 F- FDG PET/CT; head and neck cancer; tumor volume delineation; imaging biomarkers; deep learning; MANAGEMENT; IMPACT;
D O I
10.2967/jnumed.123.266574
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Artificial intelligence (AI) may decrease 18 F - FDG PET/CT-based gross tumor volume (GTV) delineation variability and automate tumorvolume-derived image biomarker extraction. Hence, we aimed to identify and evaluate promising state-of-the-art deep learning methods for head and neck cancer (HNC) PET GTV delineation. Methods: We trained and evaluated deep learning methods using retrospectively included scans of HNC patients referred for radiotherapy between January 2014 and December 2019 (ISRCTN16907234). We used 3 test datasets: an internal set to compare methods, another internal set to compare AI-to-expert variability and expert interobserver variability (IOV), and an external set to compare internal and external AI-to-expert variability. Expert PET GTVs were used as the reference standard. Our benchmark IOV was measured using the PET GTV of 6 experts. The primary outcome was the Dice similarity coefficient (DSC). ANOVA was used to compare methods, a paired t test was used to compare AI-to-expert variability and expert IOV, an unpaired t test was used to compare internal and external AI-toexpert variability, and post hoc Bland-Altman analysis was used to evaluate biomarker agreement. Results: In total, 1,220 18 F - FDG PET/CT scans of 1,190 patients (mean age +/- SD, 63 +/- 10 y; 858 men) were included, and 5 deep learning methods were trained using 5-fold cross-validation ( n = 805). The nnU-Net method achieved the highest similarity (DSC, 0.80 [95% CI, 0.77-0.86]; n = 196). We found no evidence of a difference between expert IOV and AI-to-expert variability (DSC, 0.78 for AI vs. 0.82 for experts; mean difference of 0.04 [95% CI, - 0.01 to 0.09]; P = 0.12; n = 64). We found no evidence of a difference between the internal and external AI-to-expert variability (DSC, 0.80 internally vs. 0.81 externally; mean difference of 0.004 [95% CI, - 0.05 to 0.04]; P = 0.87; n = 125). PET GTV-derived biomarkers of AI were in good agreement with experts. Conclusion: Deep learning can be used to automate 18 F - FDG PET/CT tumorvolume-derived imaging biomarkers, and the deep-learning-based volumes have the potential to assist clinical tumor volume delineation in radiation oncology.
引用
收藏
页码:623 / 629
页数:7
相关论文
共 50 条
  • [31] 18F-FDG PET and PET/CT in the Evaluation of Cancer Treatment Response
    Ben-Haim, Simona
    Ell, Peter
    JOURNAL OF NUCLEAR MEDICINE, 2009, 50 (01) : 88 - 99
  • [32] 18F-FDG PET CT in rare choroidal and mucosal melanomas of the head and neck
    Agrawal, A.
    Purandare, N.
    Shah, S.
    Sonawane, S.
    Rangarajan, V.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2013, 40 : S352 - S352
  • [33] 18F-FDG PET/CT findings of perineural involvement in head and neck malignancies
    Kalkanis, Dimitrios
    Paes, Fabio
    Gomaa, Osama
    Serafini, Aldo
    JOURNAL OF NUCLEAR MEDICINE, 2010, 51
  • [34] Predictive value of pre-therapy 18F-FDG PET/CT for the outcome of 18F-FDG PET-guided radiotherapy in patients with head and neck cancer
    M. Picchio
    M. Kirienko
    P. Mapelli
    I. Dell’Oca
    E. Villa
    F. Gallivanone
    L. Gianolli
    C. Messa
    I. Castiglioni
    European Journal of Nuclear Medicine and Molecular Imaging, 2014, 41 : 21 - 31
  • [35] Predictive value of pre-therapy 18F-FDG PET/CT for the outcome of 18F-FDG PET-guided radiotherapy in patients with head and neck cancer
    Picchio, M.
    Kirienko, M.
    Mapelli, P.
    Dell'Oca, I.
    Villa, E.
    Gallivanone, F.
    Gianolli, L.
    Messa, C.
    Castiglioni, I.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2014, 41 (01) : 21 - 31
  • [36] Perineural Spread in Head and Neck Malignancies: Clinical Significance and Evaluation with 18F-FDG PET/CT1 Response
    Paes, Fabio M.
    RADIOGRAPHICS, 2013, 33 (06) : 1738 - 1738
  • [37] The study of 18F-FDG and 18F-FCH PET/CT in the clinical evaluation of pancreatic cancer
    Cheng, Bing
    Han, Xingmin
    JOURNAL OF NUCLEAR MEDICINE, 2018, 59
  • [38] 18F-FDG PET or PET-CT to evaluate prognosis for head and neck cancer: a meta-analysis
    Xie, Peng
    Li, Minghuan
    Zhao, Hanxi
    Sun, Xindong
    Fu, Zheng
    Yu, Jinming
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2011, 137 (07) : 1085 - 1093
  • [39] 18F-FDG PET or PET-CT to evaluate prognosis for head and neck cancer: a meta-analysis
    Peng Xie
    Minghuan Li
    Hanxi Zhao
    Xindong Sun
    Zheng Fu
    Jinming Yu
    Journal of Cancer Research and Clinical Oncology, 2011, 137 : 1085 - 1093
  • [40] 18F-FDG PET/CT quantification in head and neck squamous cell cancer: principles, technical issues and clinical applications
    Manca, Gianpiero
    Vanzi, Eleonora
    Rubello, Domenico
    Giammarile, Francesco
    Grassetto, Gaia
    Wong, Ka Kit
    Perkins, Alan C.
    Colletti, Patrick M.
    Volterrani, Duccio
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2016, 43 (07) : 1360 - 1375