Understanding protein dispensability through machine-learning analysis of high-throughput data

被引:71
|
作者
Chen, Y
Xu, D [1 ]
机构
[1] UT ORNL, Grad Sch Genome Sci & Technol, Oak Ridge, TN 37830 USA
[2] Univ Missouri, Dept Comp Sci, Digital Biol Lab, Columbia, MO USA
关键词
D O I
10.1093/bioinformatics/bti058
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein dispensability is fundamental to the understanding of gene function and evolution. Recent advances in generating high-throughput data such as genomic sequence data, protein-protein interaction data, gene-expression data and growth-rate data of mutants allow us to investigate protein dispensability systematically at the genome scale. Results: In our studies, protein dispensability is represented as a fitness score that is measured by the growth rate of gene-deletion mutants. By the analyses of high-throughput data in yeast Saccharomyces cerevisiae, we found that a protein's dispensability had significant correlations with its evolutionary rate and duplication rate, as well as its connectivity in protein-protein interaction network and gene-expression correlation network. Neural network and support vector machine were applied to predict protein dispensability through high-throughput data. Our studies shed some lights on global characteristics of protein dispensability and evolution.
引用
收藏
页码:575 / 581
页数:7
相关论文
共 50 条
  • [31] Protein function prediction with high-throughput data
    Zhao, Xing-Ming
    Chen, Luonan
    Aihara, Kazuyuki
    AMINO ACIDS, 2008, 35 (03) : 517 - 530
  • [32] Emerging trends in the optimization of organic synthesis through high-throughput tools and machine learning
    Velasco, Pablo Quijano
    Hippalgaonkar, Kedar
    Ramalingam, Balamurugan
    BEILSTEIN JOURNAL OF ORGANIC CHEMISTRY, 2025, 21 : 10 - 38
  • [33] Accelerated discovery of metallic glasses through iteration of machine learning and high-throughput experiments
    Ren, Fang
    Ward, Logan
    Williams, Travis
    Laws, Kevin J.
    Wolverton, Christopher
    Hattrick-Simpers, Jason
    Mehta, Apurva
    SCIENCE ADVANCES, 2018, 4 (04):
  • [34] High-Throughput Biological Data Analysis A STEP TOWARD UNDERSTANDING CELLULAR REGULATION
    Elvitigala, Thanura R.
    Polpitiya, Ashoka D.
    Wang, Wenxue
    Stoeckel, Jana
    Khandelwal, Abha
    Quatrano, Ralph S.
    Pakrasi, Himadri B.
    Ghosh, Bijoy K.
    IEEE CONTROL SYSTEMS MAGAZINE, 2010, 30 (06): : 81 - 100
  • [35] Machine learning for mixture toxicity analysis based on high-throughput printing technology
    Duan, Qiannan
    Hu, Yuan
    Zheng, Shourong
    Lee, Jianchao
    Chen, Jiayuan
    Bi, Sifan
    Xu, Zhaoyi
    TALANTA, 2020, 207 (207)
  • [36] Machine Learning Techniques for High-Throughput Structure and Function Analysis for Proteomics and Genomics
    Zou, Quan
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2019, 22 (10) : 664 - 664
  • [37] BRAIN CANCER PREDICTION USING MACHINE LEARNING METHODS AND HIGH-THROUGHPUT MOLECULAR DATA
    Ma, B. S.
    Chang, Q.
    Geng, Y.
    Liu, G. H.
    Dong, H.
    Sun, Y. Q.
    JOURNAL OF INVESTIGATIVE MEDICINE, 2017, 65 (07) : A1 - A1
  • [38] Machine Learning (ML)-Enabled Automation for High-Throughput Data Processing in Flow Cytometry
    Kamysheva, Anna L.
    Fastovets, Dmitrii V.
    Kruglikov, Roman N.
    Sokolov, Arseniy A.
    Fefler, Anastasiya S.
    Bolshakova, Anastasiia A.
    Radko, Anastasia
    Krauz, Ilya E.
    Yong, Sheila T.
    Goldberg, Michael
    Ataullakhanov, Ravshan
    Zaitsev, Aleksandr
    BLOOD, 2023, 142
  • [39] Machine-learning assisted high-throughput discovery of solid-state electrolytes for Li-ion batteries
    Guo, Xingyu
    Wang, Zhenbin
    Yang, Ji-Hui
    Gong, Xin-Gao
    JOURNAL OF MATERIALS CHEMISTRY A, 2024, 12 (17) : 10124 - 10136
  • [40] High-throughput search for magnetic topological materials using spin-orbit spillage, machine-learning and experiments
    Choudhary, Kamal
    Garrity, Kevin F.
    Ghimire, Nirmal J.
    Anand, Naween
    Tavazza, Francesca
    arXiv, 2021,