Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data

被引:77
|
作者
Chen, Runpu [1 ]
Yang, Le [1 ]
Goodison, Steve [2 ]
Sun, Yijun [1 ,3 ,4 ]
机构
[1] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14214 USA
[2] Mayo Clin, Dept Hlth Sci Res, Jacksonville, FL 32224 USA
[3] SUNY Buffalo, Dept Microbiol & Immunol, Buffalo, NY 14214 USA
[4] SUNY Buffalo, Dept Biostat, Buffalo, NY 14214 USA
关键词
BREAST-CANCER; MODEL; DISCOVERY; CLUSTERS;
D O I
10.1093/bioinformatics/btz769
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cancer subtype classification has the potential to significantly improve disease prognosis and develop individualized patient management. Existing methods are limited by their ability to handle extremely high-dimensional data and by the influence of misleading, irrelevant factors, resulting in ambiguous and overlapping subtypes. Results: To address the above issues, we proposed a novel approach to disentangling and eliminating irrelevant factors by leveraging the power of deep learning. Specifically, we designed a deep-learning framework, referred to as DeepType, that performs joint supervised classification, unsupervised clustering and dimensionality reduction to learn cancer-relevant data representation with cluster structure. We applied DeepType to the METABRIC breast cancer dataset and compared its performance to state-of-the-art methods. DeepType significantly outperformed the existing methods, identifying more robust subtypes while using fewer genes. The new approach provides a framework for the derivation of more accurate and robust molecular cancer subtypes by using increasingly complex, multi-source data.
引用
收藏
页码:1476 / 1483
页数:8
相关论文
共 50 条
  • [21] Learning high-dimensional data
    Verleysen, M
    LIMITATIONS AND FUTURE TRENDS IN NEURAL COMPUTATION, 2003, 186 : 141 - 162
  • [22] Hybrid deep learning approach to improve classification of low-volume high-dimensional data
    Pegah Mavaie
    Lawrence Holder
    Michael K. Skinner
    BMC Bioinformatics, 24
  • [23] Multivariate Boosting for Integrative Analysis of High-Dimensional Cancer Genomic Data
    Xiong, Lie
    Kuan, Pei-Fen
    Tian, Jianan
    Keles, Sunduz
    Wang, Sijian
    CANCER INFORMATICS, 2014, 13 : 123 - 131
  • [24] Hybrid deep learning approach to improve classification of low-volume high-dimensional data
    Mavaie, Pegah
    Holder, Lawrence
    Skinner, Michael K.
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [25] An enriched approach to combining high-dimensional genomic and low-dimensional phenotypic data
    Cabrera, Javier
    Emir, Birol
    Cheng, Ge
    Duan, Yajie
    Alemayehu, Demissie
    Cherkas, Yauheniya
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2024,
  • [26] High-dimensional genomic data bias correction and data integration using MANCIE
    Chongzhi Zang
    Tao Wang
    Ke Deng
    Bo Li
    Sheng’en Hu
    Qian Qin
    Tengfei Xiao
    Shihua Zhang
    Clifford A. Meyer
    Housheng Hansen He
    Myles Brown
    Jun S. Liu
    Yang Xie
    X. Shirley Liu
    Nature Communications, 7
  • [27] High-dimensional genomic data bias correction and data integration using MANCIE
    Zang, Chongzhi
    Wang, Tao
    Deng, Ke
    Li, Bo
    Hu, Sheng'en
    Qin, Qian
    Xiao, Tengfei
    Zhang, Shihua
    Meyer, Clifford A.
    He, Housheng Hansen
    Brown, Myles
    Liu, Jun S.
    Xie, Yang
    Liu, X. Shirley
    NATURE COMMUNICATIONS, 2016, 7
  • [28] Deep learning approach to identifying cancer subtypes using convolutional hyperbolic k nearest neighbours method
    Trpin, Alenka
    Boshkoska, Biljana Mileva
    JOURNAL OF DECISION SYSTEMS, 2024,
  • [29] Functional Modeling of High-Dimensional Data: A Manifold Learning Approach
    Hernandez-Roig, Harold A.
    Aguilera-Morillo, M. Carmen
    Lillo, Rosa E.
    MATHEMATICS, 2021, 9 (04) : 1 - 22
  • [30] A reaction norm model for genomic selection using high-dimensional genomic and environmental data
    Jarquin, Diego
    Crossa, Jose
    Lacaze, Xavier
    Du Cheyron, Philippe
    Daucourt, Joelle
    Lorgeou, Josiane
    Piraux, Francis
    Guerreiro, Laurent
    Perez, Paulino
    Calus, Mario
    Burgueno, Juan
    de los Campos, Gustavo
    THEORETICAL AND APPLIED GENETICS, 2014, 127 (03) : 595 - 607