HELP: A computational framework for labelling and predicting human common and context-specific essential genes

被引:0
|
作者
Granata, Ilaria [1 ]
Maddalena, Lucia [1 ]
Manzo, Mario [2 ]
Guarracino, Mario Rosario [3 ,4 ]
Giordano, Maurizio [1 ]
机构
[1] CNR, Inst High Performance Comp & Networking, Naples, Italy
[2] Univ Naples LOrientale, Informat Technol Serv, Naples, Italy
[3] Natl Res Univ Higher Sch Econ, Lab Algorithms & Technol Network Anal, Nizhnii Novgorod, Russia
[4] Univ Cassino & Southern Lazio, Dept Econ & Law, Cassino, Frosinone, Italy
关键词
Computational framework - Essential gene - Gene annotation - Human genes - Labelings - Learning-based approach - Machine-learning - Multi-Sources - Predictive models - Source data;
D O I
10.1371/journal.pcbi.1012076
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning-based approaches are particularly suitable for identifying essential genes as they allow the generation of predictive models trained on features from multi-source data. Gene essentiality is neither binary nor static but determined by the context. The databases for essential gene annotation do not permit the personalisation of the context, and their update can be slower than the publication of new experimental data. We propose HELP (Human Gene Essentiality Labelling & Prediction), a computational framework for labelling and predicting essential genes. Its double scope allows for identifying genes based on dependency or not on experimental data. The effectiveness of the labelling method was demonstrated by comparing it with other approaches in overlapping the reference sets of essential gene annotations, where HELP demonstrated the best compromise between false and true positive rates. The gene attributes, including multi-omics and network embedding features, lead to high-performance prediction of essential genes while confirming the existence of essentiality nuances. Essential genes (EGs) are commonly defined as those required for an organism or cell's growth and survival. The essentiality is strictly dependent on both environmental and genetic conditions, determining a difference between those considered common EGs (cEGs), essential in most of the contexts considered, and those essential specifically to one or few contexts (context-specific EGs, csEGs). In this paper, we present a library of tools and methodologies to address the identification and prediction of cEGs and csEGs. Furthermore, we attempt to experimentally explore the statement that essentiality is not a binary property by identifying, predicting and analysing an intermediate class between the Essential (E) and Not Essential (NE) genes. Among the multi-source data used to predict the EGs, we found the best attributes combination to capture the essentiality. We demonstrated that the additional class of genes we defined as "almost Essential" shows differences in these attributes from the E and NE genes. We believe that investigating the context-specificity and the dynamism of essentiality is particularly relevant to unravelling crucial insights into biological mechanisms and suggesting new candidates for precision medicine.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] A framework for intelligent question answering system using semantic context-specific document clustering and Wordnet
    K Karpagam
    A Saradha
    Sādhanā, 2019, 44
  • [42] A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale
    Vieira, Vitor
    Ferreira, Jorge
    Rocha, Miguel
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (06)
  • [43] REGNET: mining context-specific human transcription networks using composite genomic information
    Sang-Mun Chi
    Young-Kyo Seo
    Young-Kyu Park
    Sora Yoon
    Chan Young Park
    Yong Sung Kim
    Seon-Young Kim
    Dougu Nam
    BMC Genomics, 15
  • [44] The human brain reactivates context-specific past information at event boundaries of naturalistic experiences
    Avital Hahamy
    Haim Dubossarsky
    Timothy E. J. Behrens
    Nature Neuroscience, 2023, 26 : 1080 - 1089
  • [45] REGNET: mining context-specific human transcription networks using composite genomic information
    Chi, Sang-Mun
    Seo, Young-Kyo
    Park, Young-Kyu
    Yoon, Sora
    Park, Chan Young
    Kim, Yong Sung
    Kim, Seon-Young
    Nam, Dougu
    BMC GENOMICS, 2014, 15
  • [46] The human brain reactivates context-specific past information at event boundaries of naturalistic experiences
    Hahamy, Avital
    Dubossarsky, Haim
    Behrens, Timothy E. J.
    NATURE NEUROSCIENCE, 2023, 26 (06) : 1080 - +
  • [47] EpiGePT: a pretrained transformer-based language model for context-specific human epigenomics
    Gao, Zijing
    Liu, Qiao
    Zeng, Wanwen
    Jiang, Rui
    Wong, Wing Hung
    GENOME BIOLOGY, 2024, 25 (01):
  • [48] A Computational Framework Based on Ensemble Deep Neural Networks for Essential Genes Identification
    Le, Nguyen Quoc Khanh
    Do, Duyen Thi
    Hung, Truong Nguyen Khanh
    Lam, Luu Ho Thanh
    Huynh, Tuan-Tu
    Nguyen, Ngan Thi Kim
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (23) : 1 - 16
  • [49] BAGEL: a computational framework for identifying essential genes from pooled library screens
    Traver Hart
    Jason Moffat
    BMC Bioinformatics, 17
  • [50] BAGEL: a computational framework for identifying essential genes from pooled library screens
    Hart, Traver
    Moffat, Jason
    BMC BIOINFORMATICS, 2016, 17