Deep-Learning Model for Tumor-Type Prediction Using Targeted Clinical Genomic Sequencing Data

被引:0
|
作者
Darmofal, Madison [1 ,2 ]
Suman, Shalabh [3 ]
Atwal, Gurnit [4 ,5 ,6 ]
Toomey, Michael [1 ,2 ]
Chen, Jie-Fu [3 ]
Chang, Jason C. [3 ]
Vakiani, Efsevia [3 ]
Varghese, Anna M. [7 ]
Balakrishnan Rema, Anoop [3 ]
Syed, Aijazuddin [3 ]
Schultz, Nikolaus [8 ,9 ,10 ]
Berger, Michael F. [3 ,8 ,9 ,12 ]
Morris, Quaid [1 ,11 ]
机构
[1] Mem Sloan Kettering Canc Ctr, Sloan Kettering Inst, Computat & Syst Biol Program, New York, NY USA
[2] Weill Cornell Med, Triinst Training Program Computat Biol & Med, New York, NY USA
[3] Mem Sloan Kettering Canc Ctr, Dept Pathol, New York, NY USA
[4] Ontario Inst Canc Res, Computat Biol Program, Toronto, ON, Canada
[5] Univ Toronto, Dept Mol Genet, Toronto, ON, Canada
[6] Vector Inst, Toronto, ON, Canada
[7] Mem Sloan Kettering Canc Ctr, Dept Med, New York, NY USA
[8] Mem Sloan Kettering Canc Ctr, Marie Josee & Henry R Kravis Ctr Mol Oncol, New York, NY USA
[9] Mem Sloan Kettering Canc Ctr, Human Oncol & Pathogenesis Program, New York, NY USA
[10] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY USA
[11] Sloan Kettering Inst, Computat & Syst Biol Program, 417 East 68th St, New York, NY 10065 USA
[12] Mem Sloan Kettering Canc Ctr, 417 East 68th St, New York, NY 10065 USA
关键词
UNKNOWN PRIMARY; CANCER; MUTATION; ENSEMBLE; ASSAY;
D O I
暂无
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
A deep learning model trained on genomic data from routine clinical sequencing accurately classifies 38 distinct tumor types, enabling guided treatment decisions for patients with cancers of unknown or uncertain origin. Tumor type guides clinical treatment decisions in cancer, but histology-based diagnosis remains challenging. Genomic alterations are highly diagnostic of tumor type, and tumor-type classifiers trained on genomic features have been explored, but the most accurate methods are not clinically feasible, relying on features derived from whole-genome sequencing (WGS), or predicting across limited cancer types. We use genomic features from a data set of 39,787 solid tumors sequenced using a clinically targeted cancer gene panel to develop Genome-Derived-Diagnosis Ensemble (GDD-ENS): a hyperparameter ensemble for classifying tumor type using deep neural networks. GDD-ENS achieves 93% accuracy for high-confidence predictions across 38 cancer types, rivaling the performance of WGS-based methods. GDD-ENS can also guide diagnoses of rare type and cancers of unknown primary and incorporate patient-specific clinical information for improved predictions. Overall, integrating GDD-ENS into prospective clinical sequencing workflows could provide clinically relevant tumor-type predictions to guide treatment decisions in real time.Significance: We describe a highly accurate tumor-type prediction model, designed specifically for clinical implementation. Our model relies only on widely used cancer gene panel sequencing data, predicts across 38 distinct cancer types, and supports integration of patient-specific nongenomic information for enhanced decision support in challenging diagnostic situations. See related commentary by Garg, p. 906. This article is featured in Selected Articles from This Issue, p. 897Significance: We describe a highly accurate tumor-type prediction model, designed specifically for clinical implementation. Our model relies only on widely used cancer gene panel sequencing data, predicts across 38 distinct cancer types, and supports integration of patient-specific nongenomic information for enhanced decision support in challenging diagnostic situations. See related commentary by Garg, p. 906. This article is featured in Selected Articles from This Issue, p. 897Significance: We describe a highly accurate tumor-type prediction model, designed specifically for clinical implementation. Our model relies only on widely used cancer gene panel sequencing data, predicts across 38 distinct cancer types, and supports integration of patient-specific nongenomic information for enhanced decision support in challenging diagnostic situations. See related commentary by Garg, p. 906. This article is featured in Selected Articles from This Issue, p. 897
引用
收藏
页码:1064 / 1081
页数:18
相关论文
共 50 条
  • [1] Classification and deep-learning–based prediction of Alzheimer disease subtypes by using genomic data
    Daichi Shigemizu
    Shintaro Akiyama
    Mutsumi Suganuma
    Motoki Furutani
    Akiko Yamakawa
    Yukiko Nakano
    Kouichi Ozaki
    Shumpei Niida
    [J]. Translational Psychiatry, 13
  • [2] A Deep-Learning Prediction Model for Imbalanced Time Series Data Forecasting
    Hou, Chenyu
    Wu, Jiawei
    Cao, Bin
    Fan, Jing
    [J]. BIG DATA MINING AND ANALYTICS, 2021, 4 (04): : 266 - 278
  • [3] A Deep-Learning Prediction Model for Imbalanced Time Series Data Forecasting
    Chenyu Hou
    Jiawei Wu
    Bin Cao
    Jing Fan
    [J]. Big Data Mining and Analytics, 2021, 4 (04) : 266 - 278
  • [4] A deep-learning model for the prediction of protein domains
    Sato, Renta
    Ekimoto, Toru
    Yoshidome, Takashi
    [J]. BIOPHYSICAL JOURNAL, 2023, 122 (03) : 142A - 142A
  • [5] Deep-learning model for tumor type classification enables enhanced clinical decision support in cancer diagnosis
    Darmofal, Madison
    Suman, Shalabh
    Atwal, Gurnit
    Chen, Jie-Fu
    Varghese, Anna
    Chang, Jason C.
    Rema, Anoop Balakrishnan
    Syed, Aijazuddin
    Morris, Quaid
    Berger, Michael
    [J]. CANCER RESEARCH, 2023, 83 (07)
  • [6] Deep-Learning Model Prediction of Radiation Pneumonitis Using Pretreatment Chest Computed Tomography and Clinical Factors
    Lee, Jang Hyung
    Kang, Min Kyu
    Park, Jongmoo
    Lee, Seoung-Jun
    Kim, Jae-Chul
    Park, Shin-Hyung
    [J]. TECHNOLOGY IN CANCER RESEARCH & TREATMENT, 2024, 23
  • [7] A study on prediction of wind power based on deep-learning using weather data
    Kim E.-J.
    Lee T.-K.
    Kim K.-H.
    [J]. Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (05): : 735 - 741
  • [8] A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data
    Montesinos-Lopez, Osval Antonio
    Montesinos-Lopez, Jose Cricelio
    Singh, Pawan
    Lozano-Ramirez, Nerida
    Barron-Lopez, Alberto
    Montesinos-Lopez, Abelardo
    Crossa, Jose
    [J]. G3-GENES GENOMES GENETICS, 2020, 10 (11): : 4177 - 4190
  • [9] Prediction of Short-Time Cloud Motion Using a Deep-Learning Model
    Su, Xinyue
    Li, Tiejian
    An, Chenge
    Wang, Guangqian
    [J]. ATMOSPHERE, 2020, 11 (11)
  • [10] Multilabel Genre Prediction Using Deep-Learning Frameworks
    Unal, Fatima Zehra
    Guzel, Mehmet Serdar
    Bostanci, Erkan
    Acici, Koray
    Asuroglu, Tunc
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (15):