Occlusion enhanced pan-cancer classification via deep learning

被引:0
|
作者
Zhao, Xing [1 ,2 ]
Chen, Zigui [3 ]
Wang, Huating [4 ]
Sun, Hao [2 ,5 ]
机构
[1] Chinese Univ Hong Kong, Dept Orthopaed & Traumatol, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Warshel Inst Computat Biol, Shenzhen, Guangdong, Peoples R China
[3] Chinese Univ Hong Kong, Dept Microbiol, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Li Ka Shing Inst Hlth Sci, Dept Orthopaed & Traumatol, Hong Kong, Peoples R China
[5] Chinese Univ Hong Kong, Li Ka Shing Inst Hlth Sci, Dept Chem Pathol, Hong Kong, Peoples R China
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
Pan-cancer classification; Marker gene identification; Deep neural network; Long short term memory; Occlusion; HUMAN PROTEIN ATLAS; CELL-PROLIFERATION; COLORECTAL-CANCER; EXPRESSION; PSEUDOGENE; PROGNOSIS; PROGRESSION; PROMOTES; SAMPLES; P53;
D O I
10.1186/s12859-024-05870-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Quantitative measurement of RNA expression levels through RNA-Seq is an ideal replacement for conventional cancer diagnosis via microscope examination. Currently, cancer-related RNA-Seq studies focus on two aspects: classifying the status and tissue of origin of a sample and discovering marker genes. Existing studies typically identify marker genes by statistically comparing healthy and cancer samples. However, this approach overlooks marker genes with low expression level differences and may be influenced by experimental results. This paper introduces "GENESO," a novel framework for pan-cancer classification and marker gene discovery using the occlusion method in conjunction with deep learning. we first trained a baseline deep LSTM neural network capable of distinguishing the origins and statuses of samples utilizing RNA-Seq data. Then, we propose a novel marker gene discovery method called "Symmetrical Occlusion (SO)". It collaborates with the baseline LSTM network, mimicking the "gain of function" and "loss of function" of genes to evaluate their importance in pan-cancer classification quantitatively. By identifying the genes of utmost importance, we then isolate them to train new neural networks, resulting in higher-performance LSTM models that utilize only a reduced set of highly relevant genes. The baseline neural network achieves an impressive validation accuracy of 96.59% in pan-cancer classification. With the help of SO, the accuracy of the second network reaches 98.30%, while using 67% fewer genes. Notably, our method excels in identifying marker genes that are not differentially expressed. Moreover, we assessed the feasibility of our method using single-cell RNA-Seq data, employing known marker genes as a validation test.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Genomic pan-cancer classification using image-based deep learning
    Ye, Taoyu
    Li, Sen
    Zhang, Yang
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 835 - 846
  • [2] Genomic pan-cancer classification using image-based deep learning
    Ye, Taoyu
    Li, Sen
    Zhang, Yang
    [J]. Zhang, Yang (zhangyang07@hit.edu.cn), 1600, Elsevier B.V. (19): : 835 - 846
  • [3] Pan-cancer integrative histology-genomic analysis via multimodal deep learning
    Chen, Richard J.
    Lu, Ming Y.
    Williamson, Drew F. K.
    Chen, Tiffany Y.
    Lipkova, Jana
    Noor, Zahra
    Shaban, Muhammad
    Shady, Maha
    Williams, Mane
    Joo, Bumjin
    Mahmood, Faisal
    [J]. CANCER CELL, 2022, 40 (08) : 865 - +
  • [4] Pan-cancer classification by regularized multi-task learning
    Hossain, Sk Md Mosaddek
    Khatun, Lutfunnesa
    Ray, Sumanta
    Mukhopadhyay, Anirban
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [5] Pan-cancer classification by regularized multi-task learning
    Sk Md Mosaddek Hossain
    Lutfunnesa Khatun
    Sumanta Ray
    Anirban Mukhopadhyay
    [J]. Scientific Reports, 11
  • [6] Identification of pan-cancer Ras pathway activation with deep learning
    Li, Xiangtao
    Li, Shaochuan
    Wang, Yunhe
    Zhang, Shixiong
    Wong, Ka-Chun
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [7] Extendable and explainable deep learning for pan-cancer radiogenomics research
    Liu, Qian
    Hu, Pingzhao
    [J]. CURRENT OPINION IN CHEMICAL BIOLOGY, 2022, 66
  • [8] Deep learning identifies conserved pan-cancer tumor features
    Noorbakhsh, Javad
    Farahmand, Saman
    Pour, Ali Foroughi
    Namburi, Sandeep
    Caruana, Dennis
    Rimm, David
    Soltanieh-Ha, Mohammad
    Zarringhalam, Kourosh
    Chuang, Jeffrey H.
    [J]. CLINICAL CANCER RESEARCH, 2021, 27 (05)
  • [9] Deep learning integrates histopathology and proteogenomics at a pan-cancer level
    Wang, Joshua M.
    Hong, Runyu
    Demicco, Elizabeth G.
    Tan, Jimin
    Lazcano, Rossana
    Moreira, Andre L.
    Li, Yize
    Calinawan, Anna
    Razavian, Narges
    Schraink, Tobias
    Gillette, Michael A.
    Omenn, Gilbert S.
    An, Eunkyung
    Rodriguez, Henry
    Tsirigos, Aristotelis
    Ruggles, Kelly, V
    Ding, Li
    Robles, Ana I.
    Mani, D. R.
    Rodland, Karin D.
    Lazar, Alexander J.
    Liu, Wenke
    Fenyo, David
    [J]. CELL REPORTS MEDICINE, 2023, 4 (09)
  • [10] PAN-CANCER PROGNOSIS PREDICTION USING MULTIMODAL DEEP LEARNING
    Silva, Luis A. Vale
    Rohr, Karl
    [J]. 2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 568 - 571