UFold: fast and accurate RNA secondary structure prediction with deep learning

被引:74
|
作者
Fu, Laiyi [1 ,2 ]
Cao, Yingxin [2 ,5 ,6 ]
Wu, Jie [3 ]
Peng, Qinke [1 ]
Nie, Qing [4 ,5 ,6 ]
Xie, Xiaohui [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Syst Engn Inst, Xian 710049, Shaanxi, Peoples R China
[2] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept Biol Chem, Irvine, CA 92697 USA
[4] Univ Calif Irvine, Dept Math, Irvine, CA 92697 USA
[5] Univ Calif Irvine, Ctr Complex Biol Syst, Irvine, CA 92697 USA
[6] Univ Calif Irvine, NSF Simons Ctr Multiscale Cell Fate Res, Irvine, CA 92697 USA
关键词
WEB SERVER; PROTEIN; DESIGN;
D O I
10.1093/nar/gkab1074
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
For many RNA molecules, the secondary structure is essential for the correct function of the RNA. Predicting RNA secondary structure from nucleotide sequences is a long-standing problem in genomics, but the prediction performance has reached a plateau over time. Traditional RNA secondary structure prediction algorithms are primarily based on thermodynamic models through free energy minimization, which imposes strong prior assumptions and is slow to run. Here, we propose a deep learning-based method, called UFold, for RNA secondary structure prediction, trained directly on annotated data and base-pairing rules. UFold proposes a novel image-like representation of RNA sequences, which can be efficiently processed by Fully Convolutional Networks (FCNs). We benchmark the performance of UFold on both within- and cross-family RNA datasets. It significantly outperforms previous methods on within-family datasets, while achieving a similar performance as the traditional methods when trained and tested on distinct RNA families. UFold is also able to predict pseudoknots accurately. Its prediction is fast with an inference time of about 160 ms per sequence up to 1500 bp in length. An online web server running UFold is available at . Code is available at .
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Deep metric learning for accurate protein secondary structure prediction
    Yang, Wei
    Liu, Yang
    Xiao, Chunjing
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 242
  • [2] AliNA - a deep learning program for RNA secondary structure prediction
    Nasaev, Shamsudin S.
    Mukanov, Artem R.
    Kuznetsov, Ivan I.
    Veselovsky, Alexander V.
    [J]. MOLECULAR INFORMATICS, 2023, 42 (12)
  • [3] Caveats to Deep Learning Approaches to RNA Secondary Structure Prediction
    Flamm, Christoph
    Wielach, Julia
    Wolfinger, Michael T.
    Badelt, Stefan
    Lorenz, Ronny
    Hofacker, Ivo L.
    [J]. FRONTIERS IN BIOINFORMATICS, 2022, 2
  • [4] RNA secondary structure prediction using deep learning with thermodynamic integration
    Kengo Sato
    Manato Akiyama
    Yasubumi Sakakibara
    [J]. Nature Communications, 12
  • [5] RNA secondary structure prediction using deep learning with thermodynamic integration
    Sato, Kengo
    Akiyama, Manato
    Sakakibara, Yasubumi
    [J]. NATURE COMMUNICATIONS, 2021, 12 (01)
  • [6] Length-Dependent Deep Learning Model for RNA Secondary Structure Prediction
    Mao, Kangkun
    Wang, Jun
    Xiao, Yi
    [J]. MOLECULES, 2022, 27 (03):
  • [7] Fast and accurate Ab Initio Protein structure prediction using deep learning potentials
    Pearce, Robin
    Li, Yang
    Omenn, Gilbert S.
    Zhang, Yang
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (09)
  • [8] RNA independent fragment partition method based on deep learning for RNA secondary structure prediction
    Zhao, Qi
    Mao, Qian
    Zhao, Zheng
    Yuan, Wenxuan
    He, Qiang
    Sun, Qixuan
    Yao, Yudong
    Fan, Xiaoya
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [9] RNA independent fragment partition method based on deep learning for RNA secondary structure prediction
    Qi Zhao
    Qian Mao
    Zheng Zhao
    Wenxuan Yuan
    Qiang He
    Qixuan Sun
    Yudong Yao
    Xiaoya Fan
    [J]. Scientific Reports, 13