Accurate Classification of RNA Structures Using Topological Fingerprints

被引:11
|
作者
Huang, Jiajie [1 ,2 ]
Li, Kejie [1 ,3 ]
Gribskov, Michael [1 ,4 ]
机构
[1] Purdue Univ, Dept Biol Sci, W Lafayette, IN 47907 USA
[2] Thermo Fisher Sci, Life Sci Solut Grp, San Francisco, CA USA
[3] Biogen Idec Inc, Computat Biol Dept, Cambridge, MA USA
[4] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
来源
PLOS ONE | 2016年 / 11卷 / 10期
基金
美国国家科学基金会;
关键词
SECONDARY STRUCTURE PREDICTION; DYNAMIC-PROGRAMMING ALGORITHM; PROTEIN DATA-BANK; INCLUDING PSEUDOKNOTS; NONCODING RNAS; ABSTRACT SHAPES; RIBONUCLEASE-P; RIBOSOMAL-RNA; SEQUENCE; GRAPHS;
D O I
10.1371/journal.pone.0164726
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
While RNAs are well known to possess complex structures, functionally similar RNAs often have little sequence similarity. While the exact size and spacing of base-paired regions vary, functionally similar RNAs have pronounced similarity in the arrangement, or topology, of base-paired stems. Furthermore, predicted RNA structures often lack pseudoknots (a crucial aspect of biological activity), and are only partially correct, or incomplete. A topological approach addresses all of these difficulties. In this work we describe each RNA structure as a graph that can be converted to a topological spectrum (RNA fingerprint). The set of subgraphs in an RNA structure, its RNA fingerprint, can be compared with the fingerprints of other RNA structures to identify and correctly classify functionally related RNAs. Topologically similar RNAs can be identified even when a large fraction, up to 30%, of the stems are omitted, indicating that highly accurate structures are not necessary. We investigate the performance of the RNA fingerprint approach on a set of eight highly curated RNA families, with diverse sizes and functions, containing pseudoknots, and with little sequence similarity-an especially difficult test set. In spite of the difficult test set, the RNA fingerprint approach is very successful (ROC AUC > 0.95). Due to the inclusion of pseudoknots, the RNA fingerprint approach both covers a wider range of possible structures than methods based only on secondary structure, and its tolerance for incomplete structures suggests that it can be applied even to predicted structures.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Classification and enumeration of topological structures of robotic mechanisms - Generation of robotic mechanisms with unconventional topological structures
    Arikawa, Keisuke
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE 2007, VOL 8, PTS A AND B, 2008, : 1091 - 1100
  • [22] Fingerprints of topological defects in a metasurface
    Kang, Ming
    Li, Yongnan
    Wang, Hui-Tian
    [J]. OPTICS LETTERS, 2014, 39 (16) : 4879 - 4882
  • [23] Topological fingerprints in Liouvillian gaps
    Kavanagh, K.
    Slingerland, J.K.
    Dooley, S.
    Kells, G.
    [J]. Physical Review E, 2024, 110 (05)
  • [24] Classification and predictions of RNA pseudoknots based on topological invariants
    Vernizzi, Graziano
    Orland, Henri
    Zee, A.
    [J]. PHYSICAL REVIEW E, 2016, 94 (04)
  • [25] Modeling RNA topological structures using small angle X-ray scattering
    Bhandari, Yuba R.
    Jiang, Wei
    Stahlberg, Eric A.
    Stagno, Jason R.
    Wang, Yun-Xing
    [J]. METHODS, 2016, 103 : 18 - 24
  • [26] Analysis and classification of RNA tertiary structures
    Abraham, Mira
    Dror, Oranit
    Nussinov, Ruth
    Wolfson, Haim J.
    [J]. RNA, 2008, 14 (11) : 2274 - 2289
  • [27] Fingerprints Selection for Topological Localization
    Popov, Vladimir
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2016 (ICNAAM-2016), 2017, 1863
  • [28] Exploration of Topological Torsion Fingerprints
    Petr, Skoda
    David, Hoksza
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 822 - 828
  • [29] IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming
    Sato, Kengo
    Kato, Yuki
    Hamada, Michiaki
    Akutsu, Tatsuya
    Asai, Kiyoshi
    [J]. BIOINFORMATICS, 2011, 27 (13) : I85 - I93
  • [30] Classification of Fingerprints Captured Using Optical Coherence Tomography
    Sousedik, Ctirad
    Breithaupt, Ralph
    Bours, Patrick
    [J]. IMAGE ANALYSIS, SCIA 2017, PT II, 2017, 10270 : 326 - 337