A multi-objective optimization approach for the identification of cancer biomarkers from RNA-seq data

被引:10
|
作者
Coleto-Alcudia, Veredas [1 ]
Vega-Rodriguez, Miguel A. [1 ]
机构
[1] Univ Extremadura, Dept Comp & Commun Technol, Campus Univ S-N, Caceres 10003, Spain
关键词
Multi-objective optimization; Evolutionary computation; Support vector machine; Cancer; Biomarker; RNA-seq; FEATURE-SELECTION; GENE-EXPRESSION; MULTICLASS;
D O I
10.1016/j.eswa.2021.116480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of biomarkers is essential for the diagnosis and prognosis of certain diseases, like cancer. Gene selection purpose is finding the minimum number of genes that can classify a (e.g. normal or tumour) sample with a high accuracy. Therefore, the selected genes can be studied as potential cancer biomarkers. In this article, a new method for gene selection is proposed in two steps. The first step is a filtering of the most relevant genes of a gene expression dataset. In this step, three feature selection methods have been combined. Since gene selection is a two-objective problem (minimizing the number of selected genes while maximizing the classification accuracy), the second step is performed as a multi-objective optimization, using an Artificial Bee Colony based on Dominance (ABCD) algorithm. ABCD algorithm uses internally a support vector machine (SVM) classifier. The method has been tested with five RNA-seq cancer datasets and with a comparative study of the results obtained by the method and by other five methods proposed in the scientific literature by other authors. Finally, in order to check if the genes selected by the proposed method could be studied as biomarkers, the relation between the selected genes and the cancer they belong to is analysed. It can be concluded that the proposed method is effective in gene selection for the identification of cancer biomarkers from RNA-seq data.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Utilizing RNA-Seq Data for Cancer Network Inference
    Cai, Ying
    Fendler, Bernard
    Atwal, Gurinder S.
    2012 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2012, : 46 - 49
  • [42] Automated identification of reference genes based on RNA-seq data
    Carmona, Rosario
    Arroyo, Macarena
    Jose Jimenez-Quesada, Maria
    Seoane, Pedro
    Zafra, Adoracion
    Larrosa, Rafael
    de Dios Alche, Juan
    Gonzalo Claros, M.
    BIOMEDICAL ENGINEERING ONLINE, 2017, 16
  • [43] Automated identification of reference genes based on RNA-seq data
    Rosario Carmona
    Macarena Arroyo
    María José Jiménez-Quesada
    Pedro Seoane
    Adoración Zafra
    Rafael Larrosa
    Juan de Dios Alché
    M. Gonzalo Claros
    BioMedical Engineering OnLine, 16
  • [44] An automated approach for global identification of sRNA-encoding regions in RNA-Seq data from Mycobacterium tuberculosis
    Wang, Ming
    Fleming, Joy
    Li, Zihui
    Li, Chuanyou
    Zhang, Hongtai
    Xue, Yunxin
    Chen, Maoshan
    Zhang, Zongde
    Zhang, Xian-En
    Bi, Lijun
    ACTA BIOCHIMICA ET BIOPHYSICA SINICA, 2016, 48 (06) : 544 - 553
  • [45] Identification of Pathogen Signatures in Prostate Cancer Using RNA-seq
    Chen, Yunqin
    Wei, Jia
    PLOS ONE, 2015, 10 (06):
  • [46] Identification of LIPH as an unfavorable biomarkers correlated with immune suppression or evasion in pancreatic cancer based on RNA-seq
    Zhuang, Hongkai
    Chen, Xinming
    Wang, Ying
    Huang, Shanzhou
    Chen, Bo
    Zhang, Chuanzhao
    Hou, Baohua
    CANCER IMMUNOLOGY IMMUNOTHERAPY, 2022, 71 (03) : 601 - 612
  • [47] Identification of LIPH as an unfavorable biomarkers correlated with immune suppression or evasion in pancreatic cancer based on RNA-seq
    Hongkai Zhuang
    Xinming Chen
    Ying Wang
    Shanzhou Huang
    Bo Chen
    Chuanzhao Zhang
    Baohua Hou
    Cancer Immunology, Immunotherapy, 2022, 71 : 601 - 612
  • [48] Identification of somatic mutations in human prostate cancer by RNA-Seq
    Xu, XiaoLin
    Zhu, KaiChang
    Liu, Feng
    Wang, Yue
    Shen, JianGuo
    Jin, Jizhong
    Wang, Zhong
    Chen, Lin
    Li, Jiadong
    Xu, Min
    GENE, 2013, 519 (02) : 343 - 347
  • [49] Identification of novel transcripts deregulated in buccal cancer by RNA-seq
    Sajnani, Manisha R.
    Patel, Amrutlal K.
    Bhatt, Vaibhav D.
    Tripathi, Ajai K.
    Ahir, Viral B.
    Shankar, Vangipuram
    Shah, Siddharth
    Shah, Tejas M.
    Koringa, Prakash G.
    Jakhesara, Subhash J.
    Joshi, Chaitanya G.
    GENE, 2012, 507 (02) : 152 - 158
  • [50] Analyzing RNA-Seq Gene Expression Data for Cancer Classification Through ML Approach
    Wahid, Abdul
    Banday, M. Tariq
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 798 - 810