Deep Learning Based Tumor Type Classification Using Gene Expression Data

被引:81
|
作者
Lyu, Boyu [1 ]
Haque, Anamul [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
关键词
Deep Learning; Tumor Type Classification; Pan-Cancer Atlas; Convolutional Neural Network; B-CELL LYMPHOMA; CANCER;
D O I
10.1145/3233547.3233588
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The differential analysis is the most significant part of RNA-Seq analysis. Conventional methods of the differential analysis usually match the tumor samples to the normal samples, which are both from the same tumor type. Such method would fail in differentiating tumor types because it lacks the knowledge from other tumor types. The Pan-Cancer Atlas provides us with abundant information on 33 prevalent tumor types which could be used as prior knowledge to generate tumor-specific biomarkers. In this paper, we embedded the high dimensional RNA-Seq data into 2-D images and used a convolutional neural network to make classification of the 33 tumor types. The final accuracy we got was 95.59%. Furthermore, based on the idea of Guided Grad Cam, as to each class, we generated significance heat-map for all the genes. By doing functional analysis on the genes with high intensities in the heat-maps, we validated that these top genes are related to tumor-specific pathways, and some of them have already been used as biomarkers, which proved the effectiveness of our method. As far as we know, we are the first to apply a convolutional neural network on Pan-Cancer Atlas for the classification of tumor types, and we are also the first to use gene's contribution in classification to the importance of genes to identify candidate biomarkers. Our experiment results show that our method has a good performance and could also apply to other genomics data.
引用
收藏
页码:89 / 96
页数:8
相关论文
共 50 条
  • [21] Deep learning-based classification and interpretation of gene expression data from cancer and normal tissues
    Ahn, TaeJin
    Goo, Taewan
    Lee, Chan-Hee
    Kim, SungMin
    Han, Kyullhee
    Park, Sangick
    Park, Taesung
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2020, 24 (02) : 121 - 139
  • [22] Cell and tumor classification using gene expression data: Construction of forests
    Zhang, HP
    Yu, CY
    Singer, B
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (07) : 4168 - 4172
  • [23] Gene selection for tumor classification using microarray gone expression data
    Yendrapalli, K.
    Basnet, R.
    Mukkamala, S.
    Sung, A. H.
    [J]. WORLD CONGRESS ON ENGINEERING 2007, VOLS 1 AND 2, 2007, : 290 - +
  • [24] BagBoosting for tumor classification with gene expression data
    Dettling, M
    [J]. BIOINFORMATICS, 2004, 20 (18) : 3583 - 3593
  • [25] Tumor Classification Based on Non-Negative Matrix Factorization Using Gene Expression Data
    Zheng, Chun-Hou
    Ng, To-Yee
    Zhang, Lei
    Shiu, Chi-Keung
    Wang, Hong-Qiang
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2011, 10 (02) : 86 - 93
  • [26] Boosting for tumor classification with gene expression data
    Dettling, M
    Bühlmann, P
    [J]. BIOINFORMATICS, 2003, 19 (09) : 1061 - 1069
  • [27] Tumor classification based on gene microarray data and hybrid learning method
    Liu, J
    Zhou, HB
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2275 - 2280
  • [28] Optimization Based Tumor Classification from Microarray Gene Expression Data
    Dagliyan, Onur
    Uney-Yuksektepe, Fadime
    Kavakli, I. Halil
    Turkay, Metin
    [J]. PLOS ONE, 2011, 6 (02):
  • [29] Gene expression data classification using topology and machine learning models
    Tamal K. Dey
    Sayan Mandal
    Soham Mukherjee
    [J]. BMC Bioinformatics, 22
  • [30] Cancer Classification of Gene Expression Data using Machine Learning Models
    De Guia, Joseph M.
    Devaraj, Madhavi
    Vea, Larry A.
    [J]. 2018 IEEE 10TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2018,