Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways

被引:310
|
作者
Chen, Lei [1 ,2 ]
Zhang, Yu-Hang [3 ]
Wang, ShaoPeng [1 ]
Zhang, YunHua [4 ]
Huang, Tao [3 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[2] Shanghai Maritime Univ, Coll Informat Engn, Shanghai, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Biol Sci, Inst Hlth Sci, Shanghai, Peoples R China
[4] Anhui Agr Univ, Sch Resources & Environm, Anhui Prov Key Lab Farmland Ecol Conversat & Poll, Hefei, Anhui, Peoples R China
来源
PLOS ONE | 2017年 / 12卷 / 09期
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
CHRONIC LYMPHOCYTIC-LEUKEMIA; MESSENGER-RNA EXPRESSION; ACUTE LYMPHOBLASTIC-LEUKEMIA; ACUTE MYELOID-LEUKEMIA; AMINO-ACID TRANSPORTER; BACILLUS-SUBTILIS; FEATURE-SELECTION; ESCHERICHIA-COLI; BINDING PROTEIN; RIBOSOMAL-RNAS;
D O I
10.1371/journal.pone.0184129
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] An Approach for the Identification of Targets Specific to Bone Metastasis Using Cancer Genes Interactome and Gene Ontology Analysis
    Vashisht, Shikha
    Bagler, Ganesh
    PLOS ONE, 2012, 7 (11):
  • [42] Bioinformatics Analysis of Key Genes and Pathways Associated with Thrombosis in Essential Thrombocythemia
    Guo, Chao
    Li, Zhenling
    MEDICAL SCIENCE MONITOR, 2019, 25 : 9262 - 9271
  • [43] Gene function prediction using protein domain probability and hierarchical Gene Ontology information
    Jung, Jaehee
    Thon, Michael R.
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2161 - 2164
  • [44] Improving protein function prediction using the hierarchical structure of the gene ontology
    Eisner, R
    Poulin, B
    Szafron, D
    Lu, P
    Greiner, R
    PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 354 - 363
  • [45] Improving subcellular localization prediction using text classification and the gene ontology
    Fyshe, Alona
    Liu, Yifeng
    Szafron, Duane
    Greiner, Russ
    Lu, Paul
    BIOINFORMATICS, 2008, 24 (21) : 2512 - 2517
  • [46] Analysis of Lymphoma-Related Genes with Gene Ontology and Kyoto Encyclopedia of Genes and Genomes Enrichment
    Sun, Qiao
    Bai, Lin
    Zhu, Shaopin
    Cheng, Lu
    Xu, Yang
    Cai, Yu-Dong
    Chen, Hui
    Zhang, Jian
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [47] Analysis of Lymphoma-Related Genes with Gene Ontology and Kyoto Encyclopedia of Genes and Genomes Enrichment
    Sun, Qiao
    Bai, Lin
    Zhu, Shaopin
    Cheng, Lu
    Xu, Yang
    Cai, Yu-Dong
    Chen, Hui
    Zhang, Jian
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [48] Analysis of Important Gene Ontology Terms and Biological Pathways Related to Pancreatic Cancer
    Yin, Hang
    Wang, ShaoPeng
    Zhang, Yu-Hang
    Cai, Yu-Dong
    Liu, Hailin
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [49] Analysis of Lymphoma-Related Genes with Gene Ontology and Kyoto Encyclopedia of Genes and Genomes Enrichment
    Sun, Qiao
    Bai, Lin
    Zhu, Shaopin
    Cheng, Lu
    Xu, Yang
    Cai, Yu-Dong
    Chen, Hui
    Zhang, Jian
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [50] KEGG-PATH: Kyoto encyclopedia of genes and genomes-based pathway analysis using a path analysis model
    Du, Junli
    Yuan, Zhifa
    Ma, Ziwei
    Song, Jiuzhou
    Xie, Xiaoli
    Chen, Yulin
    MOLECULAR BIOSYSTEMS, 2014, 10 (09) : 2441 - 2447