Understanding protein dispensability through machine-learning analysis of high-throughput data

被引:71
|
作者
Chen, Y
Xu, D [1 ]
机构
[1] UT ORNL, Grad Sch Genome Sci & Technol, Oak Ridge, TN 37830 USA
[2] Univ Missouri, Dept Comp Sci, Digital Biol Lab, Columbia, MO USA
关键词
D O I
10.1093/bioinformatics/bti058
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein dispensability is fundamental to the understanding of gene function and evolution. Recent advances in generating high-throughput data such as genomic sequence data, protein-protein interaction data, gene-expression data and growth-rate data of mutants allow us to investigate protein dispensability systematically at the genome scale. Results: In our studies, protein dispensability is represented as a fitness score that is measured by the growth rate of gene-deletion mutants. By the analyses of high-throughput data in yeast Saccharomyces cerevisiae, we found that a protein's dispensability had significant correlations with its evolutionary rate and duplication rate, as well as its connectivity in protein-protein interaction network and gene-expression correlation network. Neural network and support vector machine were applied to predict protein dispensability through high-throughput data. Our studies shed some lights on global characteristics of protein dispensability and evolution.
引用
收藏
页码:575 / 581
页数:7
相关论文
共 50 条
  • [41] Machine Learning-Driven Data Valuation for Optimizing High-Throughput Screening Pipelines
    Hesse, Joshua
    Boldini, Davide
    Sieber, Stephan A.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (21) : 8142 - 8152
  • [42] On-the-fly machine-learning for high-throughput experiments: search for rare-earth-free permanent magnets
    Kusne, Aaron Gilad
    Gao, Tieren
    Mehta, Apurva
    Ke, Liqin
    Manh Cuong Nguyen
    Ho, Kai-Ming
    Antropov, Vladimir
    Wang, Cai-Zhuang
    Kramer, Matthew J.
    Long, Christian
    Takeuchi, Ichiro
    SCIENTIFIC REPORTS, 2014, 4
  • [43] A high-throughput architecture for anomaly detection in streaming data using machine learning algorithms
    Surianarayanan C.
    Kunasekaran S.
    Chelliah P.R.
    International Journal of Information Technology, 2024, 16 (1) : 493 - 506
  • [44] On-the-fly machine-learning for high-throughput experiments: search for rare-earth-free permanent magnets
    Aaron Gilad Kusne
    Tieren Gao
    Apurva Mehta
    Liqin Ke
    Manh Cuong Nguyen
    Kai-Ming Ho
    Vladimir Antropov
    Cai-Zhuang Wang
    Matthew J. Kramer
    Christian Long
    Ichiro Takeuchi
    Scientific Reports, 4
  • [45] Improved estimation of stomatal conductance by combining high-throughput plant phenotyping data and weather variables through machine learning
    Zhang, Junxiao
    Thapa, Kantilata
    Bai, Geng
    Ge, Yufeng
    AGRICULTURAL WATER MANAGEMENT, 2025, 309
  • [46] Practical Outcomes of Applying Ensemble Machine Learning Classifiers to High-Throughput Screening (HTS) Data Analysis and Screening
    Simmons, Kirk
    Kinney, John
    Owens, Aaron
    Kleier, Daniel A.
    Bloch, Karen
    Argentar, Dave
    Walsh, Alicia
    Vaidyanathan, Ganesh
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (11) : 2196 - 2206
  • [47] A high throughput machine-learning driven analysis of Ca2+spatio-temporal maps
    Leigh, Wesley
    Del Valle, Guillermo
    Kamran, Sharif Amit
    Drumm, Bernard T.
    Tavakkoli, Alireza
    Sanders, Kenton M.
    Baker, Sal A.
    NEUROGASTROENTEROLOGY AND MOTILITY, 2020, 32
  • [48] High-throughput and data-driven machine learning techniques for discovering high-entropy alloys
    Lu, Zhichao
    Dong, Ma
    Liu, Xiongjun
    Lu, Zhaoping
    COMMUNICATIONS MATERIALS, 2024, 5 (01)
  • [49] Seeing through protein complexes by high-throughput FRET
    Nagy, Peter
    Szoellosi, Janos
    CYTOMETRY PART A, 2008, 73A (05) : 388 - 389
  • [50] High-throughput MS for intact protein analysis
    Liu, Chang
    BIOANALYSIS, 2023, 15 (16)