Deep Learning Based Tumor Type Classification Using Gene Expression Data

被引:81
|
作者
Lyu, Boyu [1 ]
Haque, Anamul [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
关键词
Deep Learning; Tumor Type Classification; Pan-Cancer Atlas; Convolutional Neural Network; B-CELL LYMPHOMA; CANCER;
D O I
10.1145/3233547.3233588
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The differential analysis is the most significant part of RNA-Seq analysis. Conventional methods of the differential analysis usually match the tumor samples to the normal samples, which are both from the same tumor type. Such method would fail in differentiating tumor types because it lacks the knowledge from other tumor types. The Pan-Cancer Atlas provides us with abundant information on 33 prevalent tumor types which could be used as prior knowledge to generate tumor-specific biomarkers. In this paper, we embedded the high dimensional RNA-Seq data into 2-D images and used a convolutional neural network to make classification of the 33 tumor types. The final accuracy we got was 95.59%. Furthermore, based on the idea of Guided Grad Cam, as to each class, we generated significance heat-map for all the genes. By doing functional analysis on the genes with high intensities in the heat-maps, we validated that these top genes are related to tumor-specific pathways, and some of them have already been used as biomarkers, which proved the effectiveness of our method. As far as we know, we are the first to apply a convolutional neural network on Pan-Cancer Atlas for the classification of tumor types, and we are also the first to use gene's contribution in classification to the importance of genes to identify candidate biomarkers. Our experiment results show that our method has a good performance and could also apply to other genomics data.
引用
收藏
页码:89 / 96
页数:8
相关论文
共 50 条
  • [31] Gene expression data classification using topology and machine learning models
    Dey, Tamal K.
    Mandal, Sayan
    Mukherjee, Soham
    [J]. BMC BIOINFORMATICS, 2022, 22 (SUPPL 10)
  • [32] Feasibility Study of Deep Learning Based Radiosensitivity Binary Classification Model Using Gene Expression Profiling
    Kim, E.
    Chung, Y.
    [J]. MEDICAL PHYSICS, 2021, 48 (06)
  • [33] Deep Learning-based Identification of Cancer or Normal Tissue using Gene Expression Data
    Ahn, TaeJin
    Goo, Taewan
    Lee, Chan-hee
    Kim, SungMin
    Han, Kyullhee
    Park, Sangick
    Park, Taesung
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1748 - 1752
  • [34] Eigengene-based linear discriminant model for tumor classification using gene expression microarray data
    Shen, Ronglai
    Ghosh, Debashis
    Chinnaiyan, Arul
    Meng, Zhaoling
    [J]. BIOINFORMATICS, 2006, 22 (21) : 2635 - 2642
  • [35] A Review on Recent Progress in Machine Learning and Deep Learning Methods for Cancer Classification on Gene Expression Data
    Mazlan, Aina Umairah
    Sahabudin, Noor Azida
    Remli, Muhammad Akmal
    Ismail, Nor Syahidatul Nadiah
    Mohamad, Mohd Saberi
    Nies, Hui Wen
    Abd Warif, Nor Bakiah
    [J]. PROCESSES, 2021, 9 (08)
  • [36] Using fuzzy kernel discriminant analysis for tumor classification with gene expression data
    Zhou, Xiaoyan
    Zheng, Wenming
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2007, 35 (SUPPL. 1): : 173 - 176
  • [37] A supervised orthogonal discriminant projection for tumor classification using gene expression data
    Zhang, Chuanlei
    Zhang, Shanwen
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2013, 43 (05) : 568 - 575
  • [38] Tumor classification by partial least squares using microarray gene expression data
    Nguyen, DV
    Rocke, DM
    [J]. BIOINFORMATICS, 2002, 18 (01) : 39 - 50
  • [39] Effective dimension reduction methods for tumor classification using gene expression data
    Antoniadis, A
    Lambert-Lacroix, S
    Leblanc, F
    [J]. BIOINFORMATICS, 2003, 19 (05) : 563 - 570
  • [40] A Discriminative Feature Extraction Approach for Tumor Classification Using Gene Expression Data
    Mei, Qinglin
    Zhang, Huaxiang
    Liang, Cheng
    [J]. CURRENT BIOINFORMATICS, 2016, 11 (05) : 561 - 570