Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways

被引:310
|
作者
Chen, Lei [1 ,2 ]
Zhang, Yu-Hang [3 ]
Wang, ShaoPeng [1 ]
Zhang, YunHua [4 ]
Huang, Tao [3 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[2] Shanghai Maritime Univ, Coll Informat Engn, Shanghai, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Biol Sci, Inst Hlth Sci, Shanghai, Peoples R China
[4] Anhui Agr Univ, Sch Resources & Environm, Anhui Prov Key Lab Farmland Ecol Conversat & Poll, Hefei, Anhui, Peoples R China
来源
PLOS ONE | 2017年 / 12卷 / 09期
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
CHRONIC LYMPHOCYTIC-LEUKEMIA; MESSENGER-RNA EXPRESSION; ACUTE LYMPHOBLASTIC-LEUKEMIA; ACUTE MYELOID-LEUKEMIA; AMINO-ACID TRANSPORTER; BACILLUS-SUBTILIS; FEATURE-SELECTION; ESCHERICHIA-COLI; BINDING PROTEIN; RIBOSOMAL-RNAS;
D O I
10.1371/journal.pone.0184129
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Prediction and Analysis of Hepatocellular Carcinoma Related Genes Using Gene Ontology and KEGG
    Jiang, Min
    Li, Bi-Qing
    Huang, Tao
    Xu, Yao Chen
    Gu, Lei
    Kong, Xiang Yin
    CURRENT BIOINFORMATICS, 2015, 10 (01) : 31 - 38
  • [2] The use of Gene Ontology terms and KEGG pathways for analysis and prediction of oncogenes
    Xing, Zhihao
    Chu, Chen
    Chen, Lei
    Kong, Xiangyin
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2016, 1860 (11): : 2725 - 2734
  • [3] Prediction and Analysis of Retinoblastoma Related Genes through Gene Ontology and KEGG
    Li, Zhen
    Li, Bi-Qing
    Jiang, Min
    Chen, Lei
    Zhang, Jian
    Liu, Lin
    Huang, Tao
    BIOMED RESEARCH INTERNATIONAL, 2013, 2013
  • [4] Analysis of cancer-related IncRNAs using gene ontology and KEGG pathways
    Chen, Lei
    Zhang, Yu-Hang
    Lu, Guohui
    Huang, Tao
    Cai, Yu-Dong
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 76 : 27 - 36
  • [5] Analysis of key pathways and genes in nodal structure on rat skin surface using gene ontology and KEGG pathway
    Shin, Joonyoung
    Park, A. Yeong
    Ju, Suk
    Lee, Hyorin
    Kang, Hyung Won
    Han, Dongwoon
    Kim, Sungchul
    GENES & GENOMICS, 2025, 47 (01) : 71 - 85
  • [6] Feature Classification and Analysis of Lung Cancer Related Genes Through Gene Ontology and KEGG Pathways
    Zhou, You
    Li, Biqing
    Zhang, Yuchao
    Chen, Lei
    Kong, Xiangyin
    CURRENT BIOINFORMATICS, 2016, 11 (01) : 40 - 50
  • [7] Analysis of the chemical toxicity effects using the enrichment of Gene Ontology terms and KEGG pathways
    Chen, Lei
    Zhang, Yu-Hang
    Zou, Quan
    Chu, Chen
    Ji, Zhiliang
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2016, 1860 (11): : 2619 - 2626
  • [8] Analysis of Tumor Suppressor Genes Based on Gene Ontology and the KEGG Pathway
    Yang, Jing
    Chen, Lei
    Kong, Xiangyin
    Huang, Tao
    Cai, Yu-Dong
    PLOS ONE, 2014, 9 (09):
  • [9] Prediction of Essential Genes by Mining Gene Ontology Semantics
    Liu, Yu-Cheng
    Chiu, Po-, I
    Huang, Hsuan-Cheng
    Tseng, Vincent S.
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2011, 6674 : 49 - +
  • [10] Identification of biologically significant genes using Gene Ontology (GO) and pathways analysis
    Golovan, Serguei
    Husain, Mainul
    JOURNAL OF IMMUNOLOGY, 2010, 184