A semi-supervised deep learning method based on stacked sparse auto-encoder for cancer prediction using RNA-seq data

被引:70
|
作者
Xiao, Yawen [1 ,2 ]
Wu, Jun [3 ,4 ]
Lin, Zongli [5 ]
Zhao, Xiaodong [6 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[2] Minist Educ, Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China
[3] East China Normal Univ, Ctr Bioinformat & Computat Biol, Shanghai Key Lab Regulatory Biol, Inst Biomed Sci, Shanghai 200241, Peoples R China
[4] East China Normal Univ, Sch Life Sci, Shanghai 200241, Peoples R China
[5] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, POB 400743, Charlottesville, VA 22904 USA
[6] Shanghai Jiao Tong Univ, Sch Biomed Engn, Shanghai 200240, Peoples R China
关键词
Stacked sparse auto-encoder; Cancer prediction; Gene expression data; Semi-supervised learning; Deep learning; FEATURE-SELECTION; MACHINE; AUTOENCODER; DIAGNOSIS; PROGNOSIS;
D O I
10.1016/j.cmpb.2018.10.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and objective: Cancer has become a complex health problem due to its high mortality. Over the past few decades, with the rapid development of the high-throughput sequencing technology and the application of various machine learning methods, remarkable progress in cancer research has been made based on gene expression data. At the same time, a growing amount of high-dimensional data has been generated, such as RNA-seq data, which calls for superior machine learning methods able to deal with mass data effectively in order to make accurate treatment decision. Methods: In this paper, we present a semi-supervised deep learning strategy, the stacked sparse auto-encoder (SSAE) based classification, for cancer prediction using RNA-seq data. The proposed SSAE based method employs the greedy layer-wise pre-training and a sparsity penalty term to help capture and extract important information from the high-dimensional data and then classify the samples. Results: We tested the proposed SSAE model on three public RNA-seq data sets of three types of cancers and compared the prediction performance with several commonly-used classification methods. The results indicate that our approach outperforms the other methods for all the three cancer data sets in various metrics. Conclusions: The proposed SSAE based semi-supervised deep learning model shows its promising ability to process high-dimensional gene expression data and is proved to be effective and accurate for cancer prediction. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:99 / 105
页数:7
相关论文
共 50 条
  • [31] Deep learning for fault-relevant feature extraction and fault classification with stacked supervised auto-encoder
    Wang, Yalin
    Yang, Haibing
    Yuan, Xiaofeng
    Shardt, Yuri A. W.
    Yang, Chunhua
    Gui, Weihua
    JOURNAL OF PROCESS CONTROL, 2020, 92 : 79 - 89
  • [32] Semi-supervised Auto-encoder Based Event Detection in Constructing Knowledge Graph for Social Good
    Zhao, Yue
    Jin, Xiaolong
    Wang, Yuanzhuo
    Cheng, Xueqi
    2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 478 - 485
  • [33] Semi-supervised active learning using convolutional auto- encoder and contrastive learning
    Roda, Hezi
    Geva, Amir B.
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [34] A deep learning method for lincRNA detection using auto-encoder algorithm
    Yu, Ning
    Yu, Zeng
    Pan, Yi
    BMC BIOINFORMATICS, 2017, 18 : 511
  • [35] scSemiAAE: a semi-supervised clustering model for single-cell RNA-seq data
    Zile Wang
    Haiyun Wang
    Jianping Zhao
    Chunhou Zheng
    BMC Bioinformatics, 24
  • [36] A deep learning method for lincRNA detection using auto-encoder algorithm
    Ning Yu
    Zeng Yu
    Yi Pan
    BMC Bioinformatics, 18
  • [37] scSemiAAE: a semi-supervised clustering model for single-cell RNA-seq data
    Wang, Zile
    Wang, Haiyun
    Zhao, Jianping
    Zheng, Chunhou
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [38] A Deep Learning Method for lincRNA Identification Using Auto-encoder Algorithm
    Yu, Ning
    Yu, Zeng
    Pan, Yi
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2016,
  • [39] Joint Sparse Auto-encoder: A Semi-supervised Spatio-temporal Approach in Mapping Large-scale Croplands
    Jia, Xiaowei
    Hu, Yifan
    Khandelwal, Ankush
    Karpatne, Anuj
    Kumar, Vipin
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1173 - 1182
  • [40] Intrusion detection using deep sparse auto-encoder and self-taught learning
    Aqsa Saeed Qureshi
    Asifullah Khan
    Nauman Shamim
    Muhammad Hanif Durad
    Neural Computing and Applications, 2020, 32 : 3135 - 3147