Unsupervised deep learning reveals prognostically relevant subtypes of glioblastoma

被引:44
|
作者
Young, Jonathan D. [1 ,2 ]
Cai, Chunhui [1 ,3 ]
Lu, Xinghua [1 ,3 ]
机构
[1] Univ Pittsburgh, Dept Biomed Informat, 5607 Baum Blvd, Pittsburgh, PA 15206 USA
[2] Univ Pittsburgh, Intelligent Syst Program, 5607 Baum Blvd, Pittsburgh, PA 15206 USA
[3] Univ Pittsburgh, Ctr Causal Discovery, 5607 Baum Blvd, Pittsburgh, PA 15206 USA
来源
BMC BIOINFORMATICS | 2017年 / 18卷
关键词
Deep learning; Unsupervised learning; Cancer; Glioblastoma multiforme; Deep belief network; Gene expression; Model selection;
D O I
10.1186/s12859-017-1798-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: One approach to improving the personalized treatment of cancer is to understand the cellular signaling transduction pathways that cause cancer at the level of the individual patient. In this study, we used unsupervised deep learning to learn the hierarchical structure within cancer gene expression data. Deep learning is a group of machine learning algorithms that use multiple layers of hidden units to capture hierarchically related, alternative representations of the input data. We hypothesize that this hierarchical structure learned by deep learning will be related to the cellular signaling system. Results: Robust deep learning model selection identified a network architecture that is biologically plausible. Our model selection results indicated that the 1st hidden layer of our deep learning model should contain about 1300 hidden units to most effectively capture the covariance structure of the input data. This agrees with the estimated number of human transcription factors, which is approximately 1400. This result lends support to our hypothesis that the 1st hidden layer of a deep learning model trained on gene expression data may represent signals related to transcription factor activation. Using the 3rd hidden layer representation of each tumor as learned by our unsupervised deep learning model, we performed consensus clustering on all tumor samples-leading to the discovery of clusters of glioblastoma multiforme with differential survival. One of these clusters contained all of the glioblastoma samples with G-CIMP, a known methylation phenotype driven by the IDH1 mutation and associated with favorable prognosis, suggesting that the hidden units in the 3rd hidden layer representations captured a methylation signal without explicitly using methylation data as input. We also found differentially expressed genes and well-known mutations (NF1, IDH1, EGFR) that were uniquely correlated with each of these clusters. Exploring these unique genes and mutations will allow us to further investigate the disease mechanisms underlying each of these clusters. Conclusions: In summary, we show that a deep learning model can be trained to represent biologically and clinically meaningful abstractions of cancer gene expression data. Understanding what additional relationships these hidden layer abstractions have with the cancer cellular signaling system could have a significant impact on the understanding and treatment of cancer.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Unsupervised deep learning reveals prognostically relevant subtypes of glioblastoma
    Jonathan D. Young
    Chunhui Cai
    Xinghua Lu
    BMC Bioinformatics, 18
  • [2] UNSUPERVISED MACHINE LEARNING ON TUMOR IMMUNE TRANSCRIPTOMIC DATA REVEALS DISTINCT IMMUNOLOGIC SUBTYPES OF GLIOBLASTOMA
    Haddad, Alexander F.
    Chen, Jia Shu
    Perera, Sudheesha
    Reddy, Anvith
    Ambati, Vardhaan
    Aghi, Manish
    NEURO-ONCOLOGY, 2022, 24 : 119 - 119
  • [3] Unsupervised machine learning reveals risk stratifying glioblastoma tumor cells
    Leelatian, Nalin
    Sinnaeve, Justine
    Mistry, Akshitkumar M.
    Barone, Sierra M.
    Brockman, Asa A.
    Diggins, Kirsten E.
    Greenplate, Allison R.
    Weaver, Kyle D.
    Thompson, Reid C.
    Chambless, Lola B.
    Mobley, Bret C.
    Ihrie, Rebecca A.
    Irish, Jonathan M.
    ELIFE, 2020, 9 : 1 - 28
  • [4] Molecular Profiling Reveals Prognostically Significant Subtypes of Canine Lymphoma
    Frantz, A. M.
    Sarver, A. L.
    Ito, D.
    Phang, T. L.
    Karimpour-Fard, A.
    Scott, M. C.
    Valli, V. E. O.
    Lindblad-Toh, K.
    Burgess, K. E.
    Husbands, B. D.
    Henson, M. S.
    Borgatti, A.
    Kisseberth, W. C.
    Hunter, L. E.
    Breen, M.
    O'Brien, T. D.
    Modiano, J. F.
    VETERINARY PATHOLOGY, 2013, 50 (04) : 693 - 703
  • [5] JOINT LEARNING OF IMAGING AND GENOMIC DATA REVEALS DISTINCT GLIOBLASTOMA SUBTYPES
    Guo, Jun
    Kazerooni, Anahita Fathi
    Akbari, Hamed
    Toorens, Erik
    Sako, Chiharu
    Mamourian, Elizabeth
    Koumenis, Constantinos
    Bagley, Stephen
    Binder, Zev A.
    Lustig, Robert
    O'Rourke, Donald
    Ganguly, Tapan
    Bakas, Spyridon
    Nasrallah, MacLean
    Davatzikos, Christos
    NEURO-ONCOLOGY, 2022, 24 : 171 - 171
  • [6] Deep learning algorithm reveals two prognostic subtypes in patients with gliomas
    Jing Tian
    Mingzhen Zhu
    Zijing Ren
    Qiang Zhao
    Puqing Wang
    Colin K. He
    Min Zhang
    Xiaochun Peng
    Beilei Wu
    Rujia Feng
    Minglong Fu
    BMC Bioinformatics, 23
  • [7] LINKING HISTOLOGICAL GLIOBLASTOMA PHENOTYPES TO TRANSCRIPTIONAL SUBTYPES AND PROGNOSIS USING DEEP LEARNING
    Roetzer-Pejrimovsky, Thomas
    Kiesel, Barbara
    Nenning, Karl-Heinz
    Klughammer, Johanna
    Rajchl, Martin
    Bock, Christoph
    Hainfellner, Johannes
    Baumann, Bernhard
    Langs, Georg
    Woehrer, Adelheid
    NEURO-ONCOLOGY, 2022, 24 : 118 - 119
  • [8] Deep learning algorithm reveals two prognostic subtypes in patients with gliomas
    Tian, Jing
    Zhu, Mingzhen
    Ren, Zijing
    Zhao, Qiang
    Wang, Puqing
    He, Colin K.
    Zhang, Min
    Peng, Xiaochun
    Wu, Beilei
    Feng, Rujia
    Fu, Minglong
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [9] Deep learning identified glioblastoma subtypes based on internal genomic expression ranks
    Mao, Xing-gang
    Xue, Xiao-yan
    Wang, Ling
    Lin, Wei
    Zhang, Xiang
    BMC CANCER, 2022, 22 (01)
  • [10] A Deep Learning-Based Framework for Supporting Clinical Diagnosis of Glioblastoma Subtypes
    Munquad, Sana
    Si, Tapas
    Mallik, Saurav
    Das, Asim Bikas
    Zhao, Zhongming
    FRONTIERS IN GENETICS, 2022, 13