A multi-objective optimization approach for the identification of cancer biomarkers from RNA-seq data

被引:10
|
作者
Coleto-Alcudia, Veredas [1 ]
Vega-Rodriguez, Miguel A. [1 ]
机构
[1] Univ Extremadura, Dept Comp & Commun Technol, Campus Univ S-N, Caceres 10003, Spain
关键词
Multi-objective optimization; Evolutionary computation; Support vector machine; Cancer; Biomarker; RNA-seq; FEATURE-SELECTION; GENE-EXPRESSION; MULTICLASS;
D O I
10.1016/j.eswa.2021.116480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of biomarkers is essential for the diagnosis and prognosis of certain diseases, like cancer. Gene selection purpose is finding the minimum number of genes that can classify a (e.g. normal or tumour) sample with a high accuracy. Therefore, the selected genes can be studied as potential cancer biomarkers. In this article, a new method for gene selection is proposed in two steps. The first step is a filtering of the most relevant genes of a gene expression dataset. In this step, three feature selection methods have been combined. Since gene selection is a two-objective problem (minimizing the number of selected genes while maximizing the classification accuracy), the second step is performed as a multi-objective optimization, using an Artificial Bee Colony based on Dominance (ABCD) algorithm. ABCD algorithm uses internally a support vector machine (SVM) classifier. The method has been tested with five RNA-seq cancer datasets and with a comparative study of the results obtained by the method and by other five methods proposed in the scientific literature by other authors. Finally, in order to check if the genes selected by the proposed method could be studied as biomarkers, the relation between the selected genes and the cancer they belong to is analysed. It can be concluded that the proposed method is effective in gene selection for the identification of cancer biomarkers from RNA-seq data.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Identification of the Pathogenic Biomarkers for Hepatocellular Carcinoma Based on RNA-seq Analyses
    Jiang, Wentao
    Zhang, Li
    Guo, Qingjun
    Wang, Honghai
    Ma, Ming
    Sun, Jisan
    Chen, Chiyi
    PATHOLOGY & ONCOLOGY RESEARCH, 2019, 25 (03) : 1207 - 1213
  • [32] The Role of the Microbiome in Colorectal Cancer derived from RNA-seq Data
    Hamed, Babaee
    RESEARCH JOURNAL OF BIOTECHNOLOGY, 2022, 17 (06): : 21 - 44
  • [33] An investigation of biomarkers derived from legacy microarray data for their utility in the RNA-seq era
    Zhenqiang Su
    Hong Fang
    Huixiao Hong
    Leming Shi
    Wenqian Zhang
    Wenwei Zhang
    Yanyan Zhang
    Zirui Dong
    Lee J Lancashire
    Marina Bessarabova
    Xi Yang
    Baitang Ning
    Binsheng Gong
    Joe Meehan
    Joshua Xu
    Weigong Ge
    Roger Perkins
    Matthias Fischer
    Weida Tong
    Genome Biology, 15
  • [34] A novel model used to detect differential splice junctions as biomarkers in prostate cancer from RNA-Seq data
    Rezaeian, Iman
    Tavakoli, Ahmad
    Cavallo-Medved, Dora
    Porter, Lisa A.
    Rueda, Luis
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 60 : 422 - 430
  • [35] An investigation of biomarkers derived from legacy microarray data for their utility in the RNA-seq era
    Su, Zhenqiang
    Fang, Hong
    Hong, Huixiao
    Shi, Leming
    Zhang, Wenqian
    Zhang, Wenwei
    Zhang, Yanyan
    Dong, Zirui
    Lancashire, Lee J.
    Bessarabova, Marina
    Yang, Xi
    Ning, Baitang
    Gong, Binsheng
    Meehan, Joe
    Xu, Joshua
    Ge, Weigong
    Perkins, Roger
    Fischer, Matthias
    Tong, Weida
    GENOME BIOLOGY, 2014, 15 (12): : 523
  • [36] Identification of hub glycogenes and their nsSNP analysis from mouse RNA-Seq data
    Firoz, Ahmad
    Malik, Adeel
    Singh, Sanjay Kumar
    Jha, Vivekanand
    Ali, Amjad
    GENE, 2015, 574 (02) : 235 - 246
  • [37] APAtrap: identification and quantification of alternative polyadenylation sites from RNA-seq data
    Ye, Congting
    Long, Yuqi
    Ji, Guoli
    Li, Qingshun Quinn
    Wu, Xiaohui
    BIOINFORMATICS, 2018, 34 (11) : 1841 - 1849
  • [38] A survey on identification and quantification of alternative polyadenylation sites from RNA-seq data
    Chen, Moliang
    Ji, Guoli
    Fu, Hongjuan
    Lin, Qianmin
    Ye, Congting
    Ye, Wenbin
    Su, Yaru
    Wu, Xiaohui
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1261 - 1276
  • [39] A Soft Approach to Multi-objective Optimization
    Bistarelli, Stefano
    Gadducci, Fabio
    Larrosa, Javier
    Rollon, Emma
    LOGIC PROGRAMMING, PROCEEDINGS, 2008, 5366 : 764 - +
  • [40] Structural RNA alignment by multi-objective optimization
    Schnattinger, Thomas
    Schoning, Uwe
    Kestler, Hans A.
    BIOINFORMATICS, 2013, 29 (13) : 1607 - 1613