A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology

被引:0
|
作者
Yan Zhang
Weiyu Shi
Yeqing Sun
机构
[1] Dalian Maritime University,College of Environmental Science and Engineering
[2] Dalian Maritime University,College of Maritime Economics & Management
来源
BMC Genomics | / 24卷
关键词
Functional gene module; Overlapping gene module; Genetic algorithm; Gene ontology; Partitioning around medoids; Gene expression data;
D O I
暂无
中图分类号
学科分类号
摘要
Since genes do not function individually, the gene module is considered an important tool for interpreting gene expression profiles. In order to consider both functional similarity and expression similarity in module identification, GMIGAGO, a functional Gene Module Identification algorithm based on Genetic Algorithm and Gene Ontology, was proposed in this work. GMIGAGO is an overlapping gene module identification algorithm, which mainly includes two stages: In the first stage (initial identification of gene modules), Improved Partitioning Around Medoids Based on Genetic Algorithm (PAM-GA) is used for the initial clustering on gene expression profiling, and traditional gene co-expression modules can be obtained. Only similarity of expression levels is considered at this stage. In the second stage (optimization of functional similarity within gene modules), Genetic Algorithm for Functional Similarity Optimization (FSO-GA) is used to optimize gene modules based on gene ontology, and functional similarity within gene modules can be improved. Without loss of generality, we compared GMIGAGO with state-of-the-art gene module identification methods on six gene expression datasets, and GMIGAGO identified the gene modules with the highest functional similarity (much higher than state-of-the-art algorithms). GMIGAGO was applied in BRCA, THCA, HNSC, COVID-19, Stem, and Radiation datasets, and it identified some interesting modules which performed important biological functions. The hub genes in these modules could be used as potential targets for diseases or radiation protection. In summary, GMIGAGO has excellent performance in mining molecular mechanisms, and it can also identify potential biomarkers for individual precision therapy.
引用
收藏
相关论文
共 50 条
  • [31] A Hybrid Ensemble Algorithm Combining AdaBoost and Genetic Algorithm for Cancer Classification with Gene Expression Data
    Lu, Huijuan
    Gao, Huiyun
    Ye, Minchao
    Wang, Xiuhui
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (03) : 863 - 870
  • [32] A Hybrid Ensemble Algorithm Combining AdaBoost and Genetic Algorithm for Cancer Classification with Gene Expression Data
    Lu, Huijuan
    Gao, Huiyun
    Ye, Minchao
    Yan, Ke
    Wang, Xiuhui
    2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 15 - 19
  • [33] Mixed recommendation algorithm based on commodity gene and genetic algorithm
    Hao, Z. (andyhao@163.com), 1600, Springer Verlag (219 LNEE):
  • [34] An ensemble correlation-based gene selection algorithm for cancer classification with gene expression data
    Piao, Yongjun
    Piao, Minghao
    Park, Kiejung
    Ryu, Keun Ho
    BIOINFORMATICS, 2012, 28 (24) : 3306 - 3315
  • [35] Gene class expression: analysis tool of Gene Ontology terms with gene expression data
    Pereira, Gislaine S. P.
    Brandao, Rodrigo M.
    Giuliatti, Silvana
    Zago, Marco A.
    Silva, Wilson A., Jr.
    GENETICS AND MOLECULAR RESEARCH, 2006, 5 (01) : 108 - 114
  • [36] Gene Expression Analyses Using Genetic Algorithm based Hybrid Approaches
    Chen, Dingjun
    Chan, Keith C. C.
    Wu, Xindong
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 963 - +
  • [37] A Homologous Gene Replacement based Genetic Algorithm
    Iqbal, Sumaiya
    Hoque, Md Tamjidul
    PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'16 COMPANION), 2016, : 91 - 92
  • [38] Protein functional module identification method combining topological features and gene expression data
    Zihao Zhao
    Wenjun Xu
    Aiwen Chen
    Yueyue Han
    Shengrong Xia
    ChuLei Xiang
    Chao Wang
    Jun Jiao
    Hui Wang
    Xiaohui Yuan
    Lichuan Gu
    BMC Genomics, 22
  • [39] Protein functional module identification method combining topological features and gene expression data
    Zhao, Zihao
    Xu, Wenjun
    Chen, Aiwen
    Han, Yueyue
    Xia, Shengrong
    Xiang, ChuLei
    Wang, Chao
    Jiao, Jun
    Wang, Hui
    Yuan, Xiaohui
    Gu, Lichuan
    BMC GENOMICS, 2021, 22 (01)
  • [40] Gene Selection for Microarray Data by a LDA-Based Genetic Algorithm
    Huerta, Edmundo Bonilla
    Duval, Beatrice
    Hao, Jin-Kao
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2008, 5265 : 250 - 261