Learning influential genes on cancer gene expression data with stacked denoising autoencoders

被引:0
|
作者
Teixeira, Vitor [1 ]
Camacho, Rui [2 ]
Ferreira, Pedro G. [3 ]
机构
[1] Univ Porto, Fac Engn, MIEIC, Porto, Portugal
[2] Univ Porto, Fac Engn, DEI, Porto, Portugal
[3] Univ Porto, Inst Invest & Inovacao Saude I3S, Ipatimup Inst Mol Pathol & Immunol, Porto, Portugal
关键词
Deep Learning; Gene Expression Analysis; Knowledge Extraction; RNA-Seq; Cancer; NETWORK; BREAST;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cancer genome projects are characterizing the genome, epigenome and transcriptome of a large number of samples using the latest high-throughput sequencing assays. The generated data sets pose several challenges for traditional statistical and machine learning methods. In this work we are interested in the task of deriving the most informative genes from a cancer gene expression data set. For that goal we built denoising autoencoders (DAE) and stacked denoising autoencoders and we studied the influence of the input nodes on the final representation of the DAE. We have also compared these deep learning approaches with other existing approaches. Our study is divided into two main tasks. First, we built and compared the performance of several feature extraction methods as well as data sampling methods using classifiers that were able to distinguish the samples of thyroid cancer patients from samples of healthy persons. In the second task, we have investigated the possibility of building comprehensible descriptions of gene expression data by using Denoising Autoencoders and Stacked Denoising Autoencoders as feature extraction methods. After extracting information related to the description built by the network, namely the connection weights, we devised post-processing techniques to extract comprehensible and biologically meaningful descriptions out of the constructed models. We have been able to build high accuracy models to discriminate thyroid cancer from healthy patients but the extraction of comprehensible models is still very limited.
引用
收藏
页码:1201 / 1205
页数:5
相关论文
共 50 条
  • [1] Online Marginalized Linear Stacked Denoising Autoencoders for Learning from Big Data Stream
    Budiman, Arif
    Fanany, Mohamad Ivan
    Basaruddin, Chan
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2015, : 227 - 235
  • [2] Improving Transfer Learning Accuracy by Reusing Stacked Denoising Autoencoders
    Kandaswamy, Chetak
    Silva, Luis M.
    Alexandre, Luis A.
    Sousa, Ricardo
    Santos, Jorge M.
    de Sa, Joaquim Marques
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 1380 - 1387
  • [3] Stacked denoising autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion
    Vincent, Pascal
    Larochelle, Hugo
    Lajoie, Isabelle
    Bengio, Yoshua
    Manzagol, Pierre-Antoine
    [J]. Journal of Machine Learning Research, 2010, 11 : 3371 - 3408
  • [4] Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion
    Vincent, Pascal
    Larochelle, Hugo
    Lajoie, Isabelle
    Bengio, Yoshua
    Manzagol, Pierre-Antoine
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 3371 - 3408
  • [5] A Self-adaptive Learning Rate Principle for Stacked Denoising Autoencoders
    HAO Qian-qian
    DING Jin-kou
    WANG Jian-fei
    [J]. 软件, 2015, (09) : 82 - 86
  • [6] Gender Identification Using Marginalised Stacked Denoising Autoencoders on Twitter Data
    Al-onazi, Badriyya B.
    Nour, Mohamed K.
    Alshamrani, Hassan
    Al Duhayyim, Mesfer
    Mohsen, Heba
    Abdelmageed, Amgad Atta
    Mohammed, Gouse Pasha
    Zamani, Abu Sarwar
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 2529 - 2544
  • [7] Clustering Mixed Data Based on Density Peaks and Stacked Denoising Autoencoders
    Duan, Baobin
    Han, Lixin
    Gou, Zhinan
    Yang, Yi
    Chen, Shuangshuang
    [J]. SYMMETRY-BASEL, 2019, 11 (02):
  • [8] A Deep Learning Approach Based on Stacked Denoising Autoencoders for Protein Function Prediction
    Miranda, Lester James V.
    Hu, Jinglu
    [J]. 2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2018, : 480 - 485
  • [9] Stacked Denoising Autoencoders for Mortality Risk Prediction Using Imbalanced Clinical Data
    Alhassan, Zakhriya
    Budgen, David
    Alshammari, Riyad
    Daghstani, Tahani
    McGough, A. Stephen
    Al Moubayed, Noura
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 541 - 546
  • [10] Data Cleaning Based on Stacked Denoising Autoencoders and Multi-Sensor Collaborations
    Chang, Xiangmao
    Qiu, Yuan
    Su, Shangting
    Yang, Deliang
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (02): : 691 - 703