Understanding protein dispensability through machine-learning analysis of high-throughput data

被引:71
|
作者
Chen, Y
Xu, D [1 ]
机构
[1] UT ORNL, Grad Sch Genome Sci & Technol, Oak Ridge, TN 37830 USA
[2] Univ Missouri, Dept Comp Sci, Digital Biol Lab, Columbia, MO USA
关键词
D O I
10.1093/bioinformatics/bti058
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein dispensability is fundamental to the understanding of gene function and evolution. Recent advances in generating high-throughput data such as genomic sequence data, protein-protein interaction data, gene-expression data and growth-rate data of mutants allow us to investigate protein dispensability systematically at the genome scale. Results: In our studies, protein dispensability is represented as a fitness score that is measured by the growth rate of gene-deletion mutants. By the analyses of high-throughput data in yeast Saccharomyces cerevisiae, we found that a protein's dispensability had significant correlations with its evolutionary rate and duplication rate, as well as its connectivity in protein-protein interaction network and gene-expression correlation network. Neural network and support vector machine were applied to predict protein dispensability through high-throughput data. Our studies shed some lights on global characteristics of protein dispensability and evolution.
引用
收藏
页码:575 / 581
页数:7
相关论文
共 50 条
  • [21] Oxygen Vacancy Formation Energy in Metal Oxides: High-Throughput Computational Studies and Machine-Learning Predictions
    Baldassarri, Bianca
    He, Jiangang
    Gopakumar, Abhijith
    Griesemer, Sean
    Salgado-Casanova, Adolfo J. A.
    Liu, Tzu-Chen
    Torrisi, Steven B.
    Wolverton, Chris
    CHEMISTRY OF MATERIALS, 2023, 35 (24) : 10619 - 10634
  • [22] Predicting drug solubility in organic solvents mixtures: A machine-learning approach supported by high-throughput experimentation
    Cenci, Francesca
    Diab, Samir
    Ferrini, Paola
    Harabajiu, Catajina
    Barolo, Massimiliano
    Bezzo, Fabrizio
    Facco, Pierantonio
    INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2024, 660
  • [23] High-throughput discovery of chemical structure-polarity relationships combining automation and machine-learning techniques
    Xu, Hao
    Lin, Jinglong
    Liu, Qianyi
    Chen, Yuntian
    Zhang, Jianning
    Yang, Yang
    Young, Michael C.
    Xu, Yan
    Zhang, Dongxiao
    Mo, Fanyang
    CHEM, 2022, 8 (12): : 3202 - 3214
  • [24] Towards the Integration of Metabolic Network Modelling and Machine Learning for the Routine Analysis of High-Throughput Patient Data
    Pacheco, Maria Pires
    Bintener, Tamara
    Sauter, Thomas
    AUTOMATED REASONING FOR SYSTEMS BIOLOGY AND MEDICINE, 2019, 30 : 401 - 424
  • [25] First-principle-data-integrated machine-learning approach for high-throughput searching of ternary electrocatalyst toward oxygen reduction reaction
    Chun, Hoje
    Lee, Eunjik
    Nam, Kyungju
    Jang, Ji-Hoon
    Kyoung, Woomin
    Noh, Seung Hyo
    Han, Byungchan
    CHEM CATALYSIS, 2021, 1 (04): : 855 - 869
  • [26] Data Resource Profile: Nationwide registry data for high-throughput epidemiology and machine learning (FinRegistry)
    Viippola, Essi
    Kuitunen, Sara
    Rodosthenous, Rodosthenis S.
    Vabalas, Andrius
    Hartonen, Tuomo
    Vartiainen, Pekka
    Demmler, Joanne
    Vuorinen, Anna-Leena
    Liu, Aoxing
    Havulinna, Aki S.
    Llorens, Vincent
    Detrois, Kira E.
    Wang, Feiyi
    Ferro, Matteo
    Karvanen, Antti
    German, Jakob
    Jukarainen, Sakari
    Gracia-Tabuenca, Javier
    Hiekkalinna, Tero
    Koskelainen, Sami
    Kiiskinen, Tuomo
    Lahtela, Elisa
    Lemmela, Susanna
    Paajanen, Teemu
    Siirtola, Harri
    Reeve, Mary Pat
    Kristiansson, Kati
    Brunfeldt, Minna
    Aavikko, Mervi
    Perola, Markus
    Ganna, Andrea
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2023, 52 (04) : E195 - E200
  • [27] Systems Analysis of High-Throughput Data
    Braun, Rosemary
    SYSTEMS BIOLOGY APPROACH TO BLOOD, 2014, 844 : 153 - 187
  • [28] High-throughput data analysis.
    Rogers, D
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2002, 224 : U510 - U510
  • [29] Machine Learning for High-Throughput Stress Phenotyping in Plants
    Singh, Arti
    Ganapathysubramanian, Baskar
    Singh, Asheesh Kumar
    Sarkar, Soumik
    TRENDS IN PLANT SCIENCE, 2016, 21 (02) : 110 - 124
  • [30] Protein function prediction with high-throughput data
    Xing-Ming Zhao
    Luonan Chen
    Kazuyuki Aihara
    Amino Acids, 2008, 35