A conditional random fields method for RNA sequence-structure relationship modeling and conformation sampling

被引:10
|
作者
Wang, Zhiyong [1 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL USA
基金
美国国家科学基金会;
关键词
SECONDARY STRUCTURE PREDICTION; TERTIARY STRUCTURES; NMR-SPECTROSCOPY; PROTEIN; MINIMIZATION; PSEUDOKNOTS; ALGORITHMS; COMPLEXITY; MOLECULES; SERVER;
D O I
10.1093/bioinformatics/btr232
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate tertiary structures are very important for the functional study of non-coding RNA molecules. However, predicting RNA tertiary structures is extremely challenging, because of a large conformation space to be explored and lack of an accurate scoring function differentiating the native structure from decoys. The fragment-based conformation sampling method (e.g. FARNA) bears shortcomings that the limited size of a fragment library makes it infeasible to represent all possible conformations well. A recent dynamic Bayesian network method, BARNACLE, overcomes the issue of fragment assembly. In addition, neither of these methods makes use of sequence information in sampling conformations. Here, we present a new probabilistic graphical model, conditional random fields (CRFs), to model RNA sequence-structure relationship, which enables us to accurately estimate the probability of an RNA conformation from sequence. Coupled with a novel tree-guided sampling scheme, our CRF model is then applied to RNA conformation sampling. Experimental results show that our CRF method can model RNA sequence-structure relationship well and sequence information is important for conformation sampling. Our method, named as TreeFolder, generates a much higher percentage of native-like decoys than FARNA and BARNACLE, although we use the same simple energy function as BARNACLE.
引用
收藏
页码:I102 / I110
页数:9
相关论文
共 50 条
  • [1] CRFalign: A Sequence-Structure Alignment of Proteins Based on a Combination of HMM-HMM Comparison and Conditional Random Fields
    Lee, Sung Jong
    Joo, Keehyoung
    Sim, Sangjin
    Lee, Juyong
    Lee, In-Ho
    Lee, Jooyoung
    [J]. MOLECULES, 2022, 27 (12):
  • [2] Sequence-structure relations of pseudoknot RNA
    Huang, Fenix W. D.
    Li, Linda Y. M.
    Reidys, Christian M.
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [3] Sequence-structure relations of pseudoknot RNA
    Fenix WD Huang
    Linda YM Li
    Christian M Reidys
    [J]. BMC Bioinformatics, 10
  • [4] Paramecium: RNA sequence-structure phylogenetics
    Weimer, Marlyn
    Vd'acny, Peter
    Wolf, Matthias
    [J]. INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2023, 73 (04)
  • [5] Sequence-structure relationships in RNA loops: establishing the basis for loop homology modeling
    Schudoma, Christian
    May, Patrick
    Nikiforova, Viktoria
    Walther, Dirk
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 (03) : 970 - 980
  • [6] Insertions and deletions in the RNA sequence-structure map
    Martin, Nora S.
    Ahnert, Sebastian E.
    [J]. JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2021, 18 (183)
  • [7] Servers for sequence-structure relationship analysis and prediction
    Dosztányi, Z
    Magyar, C
    Tusnády, GE
    Cserzo, M
    Fiser, A
    Simon, I
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3359 - 3363
  • [8] The sequence-structure relationship and protein function prediction
    Sadowski, M. I.
    Jones, D. T.
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2009, 19 (03) : 357 - 362
  • [9] Exploring the sequence-structure relationship for amyloid peptides
    Morris, Kyle L.
    Rodger, Alison
    Hicks, Matthew R.
    Debulpaep, Maya
    Schymkowitz, Joost
    Rousseau, Frederic
    Serpell, Louise C.
    [J]. BIOCHEMICAL JOURNAL, 2013, 450 : 275 - 283
  • [10] A Novel Method of Citation Sequence Labeling Based on Conditional Random Fields
    Zhou, Junxian
    Shen, Derong
    Nie, Tiezheng
    Kou, Yue
    Yu, Ge
    [J]. 2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 184 - 187