A conditional random fields method for RNA sequence-structure relationship modeling and conformation sampling

被引:10
|
作者
Wang, Zhiyong [1 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL USA
基金
美国国家科学基金会;
关键词
SECONDARY STRUCTURE PREDICTION; TERTIARY STRUCTURES; NMR-SPECTROSCOPY; PROTEIN; MINIMIZATION; PSEUDOKNOTS; ALGORITHMS; COMPLEXITY; MOLECULES; SERVER;
D O I
10.1093/bioinformatics/btr232
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate tertiary structures are very important for the functional study of non-coding RNA molecules. However, predicting RNA tertiary structures is extremely challenging, because of a large conformation space to be explored and lack of an accurate scoring function differentiating the native structure from decoys. The fragment-based conformation sampling method (e.g. FARNA) bears shortcomings that the limited size of a fragment library makes it infeasible to represent all possible conformations well. A recent dynamic Bayesian network method, BARNACLE, overcomes the issue of fragment assembly. In addition, neither of these methods makes use of sequence information in sampling conformations. Here, we present a new probabilistic graphical model, conditional random fields (CRFs), to model RNA sequence-structure relationship, which enables us to accurately estimate the probability of an RNA conformation from sequence. Coupled with a novel tree-guided sampling scheme, our CRF model is then applied to RNA conformation sampling. Experimental results show that our CRF method can model RNA sequence-structure relationship well and sequence information is important for conformation sampling. Our method, named as TreeFolder, generates a much higher percentage of native-like decoys than FARNA and BARNACLE, although we use the same simple energy function as BARNACLE.
引用
收藏
页码:I102 / I110
页数:9
相关论文
共 50 条
  • [31] Training for Gibbs Sampling on Conditional Random Fields with Neural Scoring Factors
    Gao, Sida
    Gormley, Matthew R.
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4999 - 5011
  • [32] RNA-Pareto: interactive analysis of Pareto-optimal RNA sequence-structure alignments
    Schnattinger, Thomas
    Schoening, Uwe
    Marchfelder, Anita
    Kestler, Hans A.
    [J]. BIOINFORMATICS, 2013, 29 (23) : 3102 - 3104
  • [33] RNANetMotif: Identifying sequence-structure RNA network motifs in RNA-protein binding sites
    Ma, Hongli
    Wen, Han
    Xue, Zhiyuan
    Li, Guojun
    Zhang, Zhaolei
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (07)
  • [34] CONDITIONAL SAMPLING FOR SPECTRALLY DISCRETE MAX-STABLE RANDOM FIELDS
    Wang, Yizao
    Stoev, Stilian A.
    [J]. ADVANCES IN APPLIED PROBABILITY, 2011, 43 (02) : 461 - 483
  • [35] Diagram structure recognition by Bayesian conditional random fields
    Qi, Y
    Szummer, M
    Minka, TP
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 191 - 196
  • [36] A random sampling method for radiobiological modeling
    Ma, Glenn C.
    [J]. RADIATION PHYSICS AND CHEMISTRY, 2024, 221
  • [37] Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization
    Bauer, Markus
    Klau, Gunnar W.
    Reinert, Knut
    [J]. BMC BIOINFORMATICS, 2007, 8 (1)
  • [38] LaRA 2: parallel and vectorized program for sequence-structure alignment of RNA sequences
    Winkler, Jorg
    Urgese, Gianvito
    Ficarra, Elisa
    Reinert, Knut
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [39] Structator: fast index-based search for RNA sequence-structure patterns
    Meyer, Fernando
    Kurtz, Stefan
    Backofen, Rolf
    Will, Sebastian
    Beckstette, Michael
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [40] AN EFFICIENT PANSHARPENING METHOD BASED ON CONDITIONAL RANDOM FIELDS
    Yang, Yong
    Lu, Hangyuan
    Huang, Shuying
    Tu, Wei
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,