Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction

被引:0
|
作者
Bahai, Akash [1 ]
Kwoh, Chee Keong [2 ]
Mu, Yuguang [1 ]
Li, Yinghui [1 ]
机构
[1] Nanyang Technol Univ, Sch Biol Sci SBS, Singapore, Singapore
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
关键词
3D STRUCTURE; NOVO PREDICTION; PROTEIN; PUZZLES; OPPORTUNITIES; MODEL;
D O I
10.1371/journal.pcbi.1012715
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The 3D structure of RNA critically influences its functionality, and understanding this structure is vital for deciphering RNA biology. Experimental methods for determining RNA structures are labour-intensive, expensive, and time-consuming. Computational approaches have emerged as valuable tools, leveraging physics-based-principles and machine learning to predict RNA structures rapidly. Despite advancements, the accuracy of computational methods remains modest, especially when compared to protein structure prediction. Deep learning methods, while successful in protein structure prediction, have shown some promise for RNA structure prediction as well, but face unique challenges. This study systematically benchmarks state-of-the-art deep learning methods for RNA structure prediction across diverse datasets. Our aim is to identify factors influencing performance variation, such as RNA family diversity, sequence length, RNA type, multiple sequence alignment (MSA) quality, and deep learning model architecture. We show that generally ML-based methods perform much better than non-ML methods on most RNA targets, although the performance difference isn't substantial when working with unseen novel or synthetic RNAs. The quality of the MSA and secondary structure prediction both play an important role and most methods aren't able to predict non-Watson-Crick pairs in the RNAs. Overall among the automated 3D RNA structure prediction methods, DeepFoldRNA has the best prediction followed by DRFold as the second best method. Finally, we also suggest possible mitigations to improve the quality of the prediction for future method development.
引用
收藏
页数:44
相关论文
共 50 条
  • [1] Deep dive into RNA: a systematic literature review on RNA structure prediction using machine learning methods
    Budnik, Michal
    Wawrzyniak, Jakub
    Grala, Lukasz
    Kadzinski, Milosz
    Szostak, Natalia
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (09)
  • [2] Deep learning for RNA structure prediction
    Wang, Jiuming
    Fan, Yimin
    Hong, Liang
    Hu, Zhihang
    Li, Yu
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2025, 91
  • [3] Deep-learning Prediction Based Molecular Structure Virtual Screening
    Jeon, Yerin
    Lee, Kyu-Hwang
    Lee, Hokyung
    KOREAN CHEMICAL ENGINEERING RESEARCH, 2020, 58 (02): : 230 - 234
  • [4] Deep learning methods in protein structure prediction
    Torrisi, Mirko
    Pollastri, Gianluca
    Le, Quan
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 1301 - 1310
  • [5] Benchmarking the SHL Recognition Challenge with Classical and Deep-Learning Pipelines
    Wang, Lin
    Gjoreski, Hristijan
    Ciliberto, Mathias
    Mekki, Sami
    Valentin, Stefan
    Roggen, Daniel
    PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 1626 - 1635
  • [6] A Survey on Deep-Learning Methods for Pedestrian Behavior Prediction from the Egocentric View
    Chen, Tina
    Tian, Renran
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1898 - 1905
  • [7] A Systematic Review of Deep-Learning Methods for Intracranial Aneurysm Detection in CT Angiography
    Bizjak, Ziga
    Spiclin, Ziga
    BIOMEDICINES, 2023, 11 (11)
  • [8] CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction
    Puton, Tomasz
    Kozlowski, Lukasz P.
    Rother, Kristian M.
    Bujnicki, Janusz M.
    NUCLEIC ACIDS RESEARCH, 2013, 41 (07) : 4307 - 4323
  • [9] RNA3DB: A structurally-dissimilar dataset split for training and benchmarking deep learning models for RNA structure prediction
    Szikszai, Marcell
    Magnus, Marcin
    Sanghi, Siddhant
    Kadyan, Sachin
    Bouatta, Nazim
    Rivas, Elena
    JOURNAL OF MOLECULAR BIOLOGY, 2024, 436 (17)
  • [10] Computational prediction of RNA tertiary structures using machine learning methods*
    Huang, Bin
    Du, Yuanyang
    Zhang, Shuai
    Li, Wenfei
    Wang, Jun
    Zhang, Jian
    CHINESE PHYSICS B, 2020, 29 (10)