More challenges for machine-learning protein interactions

被引:44
|
作者
Hamp, Tobias [1 ]
Rost, Burkhard [1 ]
机构
[1] Tech Univ Munich, Dept Informat Bioinformat & Computat Biol I12, D-85748 Garching, Germany
关键词
DATABASE;
D O I
10.1093/bioinformatics/btu857
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Machine learning may be the most popular computational tool in molecular biology. Providing sustained performance estimates is challenging. The standard cross-validation protocols usually fail in biology. Park and Marcotte found that even refined protocols fail for protein-protein interactions (PPIs). Results: Here, we sketch additional problems for the prediction of PPIs from sequence alone. First, it not only matters whether proteins A or B of a target interaction A-B are similar to proteins of training interactions (positives), but also whether A or B are similar to proteins of non-interactions (negatives). Second, training on multiple interaction partners per protein did not improve performance for new proteins (not used to train). In contrary, a strictly non-redundant training that ignored good data slightly improved the prediction of difficult cases. Third, which prediction method appears to be best crucially depends on the sequence similarity between the test and the training set, how many true interactions should be found and the expected ratio of negatives to positives. The correct assessment of performance is the most complicated task in the development of prediction methods. Our analyses suggest that PPIs square the challenge for this task.
引用
收藏
页码:1521 / 1525
页数:5
相关论文
共 50 条
  • [1] Machine-learning techniques for the prediction of protein–protein interactions
    Debasree Sarkar
    Sudipto Saha
    [J]. Journal of Biosciences, 2019, 44
  • [2] Machine-learning techniques for the prediction of protein-protein interactions
    Sarkar, Debasree
    Saha, Sudipto
    [J]. JOURNAL OF BIOSCIENCES, 2019, 44 (04)
  • [3] Assessing the druggability of protein-protein interactions by a supervised machine-learning method
    Nobuyoshi Sugaya
    Kazuyoshi Ikeda
    [J]. BMC Bioinformatics, 10
  • [4] Assessing the druggability of protein-protein interactions by a supervised machine-learning method
    Sugaya, Nobuyoshi
    Ikeda, Kazuyoshi
    [J]. BMC BIOINFORMATICS, 2009, 10 : 263
  • [5] Even More Confident predictions with deep machine-learning
    Poggi, Matteo
    Tosi, Fabio
    Mattoccia, Stefano
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 393 - 401
  • [6] Machine-Learning Implementation in Clinical Anesthesia: Opportunities and Challenges
    Char, Danton S.
    Burgart, Alyssa
    [J]. ANESTHESIA AND ANALGESIA, 2020, 130 (06): : 1709 - 1712
  • [7] Design Challenges on Machine-Learning Enabled Resource Optimization
    Karkazis, Panagiotis
    Uzunidis, Dimitris
    Trakadas, Panagiotis
    Leligou, Helen C. C.
    [J]. IT PROFESSIONAL, 2022, 24 (05) : 69 - 74
  • [8] An Interpretable Machine-Learning Algorithm to Predict Disordered Protein Phase Separation Based on Biophysical Interactions
    Cai, Hao
    Vernon, Robert M.
    Forman-Kay, Julie D.
    [J]. BIOMOLECULES, 2022, 12 (08)
  • [9] Less is more: a new machine-learning methodology for spatiotemporal systems
    Sihan Feng
    Kang Wang
    Fuming Wang
    Yong Zhang
    Hong Zhao
    [J]. Communications in Theoretical Physics, 2022, 74 (05) : 118 - 124
  • [10] Less is more: a new machine-learning methodology for spatiotemporal systems
    Feng, Sihan
    Wang, Kang
    Wang, Fuming
    Zhang, Yong
    Zhao, Hong
    [J]. COMMUNICATIONS IN THEORETICAL PHYSICS, 2022, 74 (05)