Understanding protein dispensability through machine-learning analysis of high-throughput data

被引:71
|
作者
Chen, Y
Xu, D [1 ]
机构
[1] UT ORNL, Grad Sch Genome Sci & Technol, Oak Ridge, TN 37830 USA
[2] Univ Missouri, Dept Comp Sci, Digital Biol Lab, Columbia, MO USA
关键词
D O I
10.1093/bioinformatics/bti058
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein dispensability is fundamental to the understanding of gene function and evolution. Recent advances in generating high-throughput data such as genomic sequence data, protein-protein interaction data, gene-expression data and growth-rate data of mutants allow us to investigate protein dispensability systematically at the genome scale. Results: In our studies, protein dispensability is represented as a fitness score that is measured by the growth rate of gene-deletion mutants. By the analyses of high-throughput data in yeast Saccharomyces cerevisiae, we found that a protein's dispensability had significant correlations with its evolutionary rate and duplication rate, as well as its connectivity in protein-protein interaction network and gene-expression correlation network. Neural network and support vector machine were applied to predict protein dispensability through high-throughput data. Our studies shed some lights on global characteristics of protein dispensability and evolution.
引用
收藏
页码:575 / 581
页数:7
相关论文
共 50 条
  • [1] Machine-learning models for high-throughput materials discovery
    Landrum, GA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2003, 225 : U560 - U560
  • [2] Comparative study of machine-learning and chemometric tools for analysis of in-vivo high-throughput screening data
    Simmons, Kirk
    Kinney, John
    Owens, Aaron
    Kleier, Dan
    Bloch, Karen
    Argentar, Dave
    Walsh, Alicia
    Vaidyanathan, Ganesh
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (08) : 1663 - 1668
  • [3] Machine-learning and high-throughput studies for high-entropy materials
    Huang, E-Wen
    Lee, Wen-Jay
    Singh, Sudhanshu Shekhar
    Kumar, Poresh
    Lee, Chih-Yu
    Lam, Tu-Ngoc
    Chin, Hsu-Hsuan
    Lin, Bi-Hsuan
    Liaw, Peter K.
    MATERIALS SCIENCE & ENGINEERING R-REPORTS, 2022, 147
  • [4] High-throughput and machine-learning accelerated design of high entropy alloy catalysts
    Chen, Zhi Wen
    Chen, LiXin
    Gariepy, Zachary
    Yao, Xue
    Singh, Chandra Veer
    TRENDS IN CHEMISTRY, 2022, 4 (07): : 577 - 579
  • [5] DianNao: A Small-Footprint High-Throughput Accelerator for Ubiquitous Machine-Learning
    Chen, Tianshi
    Du, Zidong
    Sun, Ninghui
    Wang, Jia
    Wu, Chengyong
    Chen, Yunji
    Temam, Olivier
    ACM SIGPLAN NOTICES, 2014, 49 (04) : 269 - 283
  • [6] Electrochemical stability of biodegradable Zn-Cu alloys through machine-learning accelerated high-throughput discovery
    Luo, Kun
    Liu, Zhaorui
    Yu, Rui
    Xu, Tengfei
    Legut, Dominik
    Yin, Xing
    Zhang, Ruifeng
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2024, 26 (35) : 23010 - 23022
  • [7] Machine-learning potentials enable predictive and tractable high-throughput screening of random alloys
    Hodapp, M.
    Shapeev, A.
    PHYSICAL REVIEW MATERIALS, 2021, 5 (11)
  • [8] Multidimensional Classification of Catalysts in Oxidative Coupling of Methane through Machine Learning and High-Throughput Data
    Takahashi, Keisuke
    Takahashi, Lauren
    Thanh Nhat Nguyen
    Thakur, Ashutosh
    Taniike, Toshiaki
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2020, 11 (16): : 6819 - 6826
  • [9] Understanding molecular mechanisms with high-throughput analysis and data mining
    Sucularli, Ceren
    INTERNATIONAL JOURNAL OF MOLECULAR MEDICINE, 2016, 38 : S25 - S25
  • [10] Enabling Catalyst Discovery through Machine Learning and High-Throughput Experimentation
    Williams, Travis
    McCullough, Katherine
    Lauterbach, Jochen A.
    CHEMISTRY OF MATERIALS, 2020, 32 (01) : 157 - 165