Occlusion enhanced pan-cancer classification via deep learning

被引:0
|
作者
Zhao, Xing [1 ,2 ]
Chen, Zigui [3 ]
Wang, Huating [4 ]
Sun, Hao [2 ,5 ]
机构
[1] Chinese Univ Hong Kong, Dept Orthopaed & Traumatol, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Warshel Inst Computat Biol, Shenzhen, Guangdong, Peoples R China
[3] Chinese Univ Hong Kong, Dept Microbiol, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Li Ka Shing Inst Hlth Sci, Dept Orthopaed & Traumatol, Hong Kong, Peoples R China
[5] Chinese Univ Hong Kong, Li Ka Shing Inst Hlth Sci, Dept Chem Pathol, Hong Kong, Peoples R China
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
Pan-cancer classification; Marker gene identification; Deep neural network; Long short term memory; Occlusion; HUMAN PROTEIN ATLAS; CELL-PROLIFERATION; COLORECTAL-CANCER; EXPRESSION; PSEUDOGENE; PROGNOSIS; PROGRESSION; PROMOTES; SAMPLES; P53;
D O I
10.1186/s12859-024-05870-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Quantitative measurement of RNA expression levels through RNA-Seq is an ideal replacement for conventional cancer diagnosis via microscope examination. Currently, cancer-related RNA-Seq studies focus on two aspects: classifying the status and tissue of origin of a sample and discovering marker genes. Existing studies typically identify marker genes by statistically comparing healthy and cancer samples. However, this approach overlooks marker genes with low expression level differences and may be influenced by experimental results. This paper introduces "GENESO," a novel framework for pan-cancer classification and marker gene discovery using the occlusion method in conjunction with deep learning. we first trained a baseline deep LSTM neural network capable of distinguishing the origins and statuses of samples utilizing RNA-Seq data. Then, we propose a novel marker gene discovery method called "Symmetrical Occlusion (SO)". It collaborates with the baseline LSTM network, mimicking the "gain of function" and "loss of function" of genes to evaluate their importance in pan-cancer classification quantitatively. By identifying the genes of utmost importance, we then isolate them to train new neural networks, resulting in higher-performance LSTM models that utilize only a reduced set of highly relevant genes. The baseline neural network achieves an impressive validation accuracy of 96.59% in pan-cancer classification. With the help of SO, the accuracy of the second network reaches 98.30%, while using 67% fewer genes. Notably, our method excels in identifying marker genes that are not differentially expressed. Moreover, we assessed the feasibility of our method using single-cell RNA-Seq data, employing known marker genes as a validation test.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Functional and embedding feature analysis for pan-cancer classification
    Lu, Jian
    Li, JiaRui
    Ren, Jingxin
    Ding, Shijian
    Zeng, Zhenbing
    Huang, Tao
    Cai, Yu-Dong
    [J]. FRONTIERS IN ONCOLOGY, 2022, 12
  • [22] Pan-cancer classification of multi-omics data based on machine learning models
    Cava, Claudia
    Sabetian, Soudabeh
    Salvatore, Christian
    Castiglioni, Isabella
    [J]. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
  • [23] Deep Learning-Based Pan-Cancer Classification Model Reveals Tissue-of-Origin Specific Gene Expression Signatures
    Divate, Mayur
    Tyagi, Aayush
    Richard, Derek J.
    Prasad, Prathosh A.
    Gowda, Harsha
    Nagaraj, Shivashankar H.
    [J]. CANCERS, 2022, 14 (05)
  • [24] Pan-cancer classification of single cells in the tumour microenvironment
    Ido Nofech-Mozes
    David Soave
    Philip Awadalla
    Sagi Abelson
    [J]. Nature Communications, 14
  • [25] AVBAE-MODFR: A novel deep learning framework of embedding and feature selection on multi-omics data for pan-cancer classification
    Li, Minghe
    Guo, Huike
    Wang, Keao
    Kang, Chuanze
    Yin, Yanbin
    Zhang, Han
    [J]. Computers in Biology and Medicine, 2024, 177
  • [26] Pan-cancer methylome analysis for cancer diagnosis and classification of cancer cell of origin
    Dai Shimizu
    Kenzui Taniue
    Yusuke Matsui
    Hiroshi Haeno
    Hiromitsu Araki
    Fumihito Miura
    Mitsuko Fukunaga
    Kenji Shiraishi
    Yuji Miyamoto
    Seiichi Tsukamoto
    Aya Komine
    Yuta Kobayashi
    Akihiro Kitagawa
    Yukihiro Yoshikawa
    Kuniaki Sato
    Tomoko Saito
    Shuhei Ito
    Takaaki Masuda
    Atsushi Niida
    Makoto Suzuki
    Hideo Baba
    Takashi Ito
    Nobuyoshi Akimitsu
    Yasuhiro Kodera
    Koshi Mimori
    [J]. Cancer Gene Therapy, 2022, 29 : 428 - 436
  • [27] A pan-cancer PDX histology image repository with genomic and pathological annotations for deep learning analysis
    White, Brian S.
    Woo, Xing Yi
    Koc, Soner
    Sheridan, Todd
    Neuhauser, Steven B.
    Wang, Shidan
    Evrard, Yvonne A.
    Landua, John David
    Mashl, R. Jay
    Davies, Sherri R.
    Fang, Bingliang
    Raso, Maria Gabriela
    Evans, Kurt W.
    Bailey, Matthew H.
    Chen, Yeqing
    Xiao, Min
    Rubinstein, Jill
    Pour, Ali Foroughi
    Dobrolecki, Lacey Elizabeth
    Fujita, Maihi
    Fujimoto, Junya
    Xiao, Guanghua
    Fields, Ryan C.
    Mudd, Jacqueline L.
    Xu, Xiaowei
    Hollingshead, Melinda G.
    Jiwani, Shahanawaz
    Consortium, Pdxnet
    Wallace, Tiffany A.
    Moscow, Jeffrey A.
    Doroshow, James H.
    Mitsiades, Nicholas
    Kaochar, Salma
    Pan, Chong-Xian
    Chen, Moon S.
    Carvajal-Carmona, Luis G.
    Welm, Alana L.
    Welm, Bryan E.
    Govindan, Ramaswamy
    Li, Shunqiang
    Davies, Michael A.
    Roth, Jack A.
    Meric-Bernstam, Funda
    Xie, Yang
    Herlyn, Meenhard
    Ding, Li
    Lewis, Michael T.
    Bolt, Carol J.
    Dean, Dennis A.
    Chuang, Jeffrey H.
    [J]. CANCER RESEARCH, 2023, 83 (07)
  • [28] Pan-cancer methylome analysis for cancer diagnosis and classification of cancer cell of origin
    Shimizu, Dai
    Taniue, Kenzui
    Matsui, Yusuke
    Haeno, Hiroshi
    Araki, Hiromitsu
    Miura, Fumihito
    Fukunaga, Mitsuko
    Shiraishi, Kenji
    Miyamoto, Yuji
    Tsukamoto, Seiichi
    Komine, Aya
    Kobayashi, Yuta
    Kitagawa, Akihiro
    Yoshikawa, Yukihiro
    Sato, Kuniaki
    Saito, Tomoko
    Ito, Shuhei
    Masuda, Takaaki
    Niida, Atsushi
    Suzuki, Makoto
    Baba, Hideo
    Ito, Takashi
    Akimitsu, Nobuyoshi
    Kodera, Yasuhiro
    Mimori, Koshi
    [J]. CANCER GENE THERAPY, 2022, 29 (05) : 428 - 436
  • [29] Pan-cancer classification on gene expression data by neural network
    Yu, Kijin
    Kim, Bong-Hyun
    Lee, Peter Chang Whan
    [J]. CANCER RESEARCH, 2019, 79 (13)
  • [30] Universal encoding of pan-cancer histology by deep texture representations
    Komura, Daisuke
    Kawabe, Akihiro
    Fukuta, Keisuke
    Sano, Kyohei
    Umezaki, Toshikazu
    Koda, Hirotomo
    Suzuki, Ryohei
    Tominaga, Ken
    Ochi, Mieko
    Konishi, Hiroki
    Masakado, Fumiya
    Saito, Noriyuki
    Sato, Yasuyoshi
    Onoyama, Takumi
    Nishida, Shu
    Furuya, Genta
    Katoh, Hiroto
    Yamashita, Hiroharu
    Kakimi, Kazuhiro
    Seto, Yasuyuki
    Ushiku, Tetsuo
    Fukayama, Masashi
    Ishikawa, Shumpei
    [J]. CELL REPORTS, 2022, 38 (09):