Sequence-structure patterns: Discovery and applications

被引：0

作者：

Milledge, T ^{[1
]}

Khuri, S ^{[1
]}

Wei, X ^{[1
]}

Yang, C ^{[1
]}

Zheng, G ^{[1
]}

Narasimhan, G ^{[1
]}

机构：

[1] Florida Int Univ, Sch Comp Sci, BioRG, Miami, FL 33199 USA

来源：

Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3 | 2005年

关键词：

pattern discovery; sequence alignment; structure alignment; sequence-structure patterns;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Protein sequence data is being generated at a tremendous rate; however, functional annotation of these proteins is proceeding at a much slower pace. Biologists rely on computational biology and pattern recognition to predict the functionality of proteins. This is based on the fact that proteins that share a similar function often exhibit conserved sequence patterns. Such sequence patterns, or motifs, are derived from multiple sequence alignments and have been collected in databases such as PROSITE, PRINTS, SPAT, and eMOTIF. These patterns help to classify proteins into families where the exact function may or may not be known. Research has shown that these domain signatures often exhibit specific three-dimensional structures. In this paper, we show how starting from a seed sequence pattern from any of the existing sequence pattern databases, and using information from the protein structure databases, it is possible to design biologically meaningful sequencestructure patterns (SSPs). An important by-product of our method to generate sequence-structure patterns is an improved sequence alignment as well as an improved structural alignment of proteins belonging to a family and containing that pattern. Validation was performed by matching the resulting SSPs to domains in the ASTRAL compendium associated with a family or super-family designation in the SCOP database. SSPs generated by this method were frequently either fully specific (no false positives), fully sensitive (no false negatives), or both (diagnostic).

引用

页码：1282 / 1285

页数：4

共 50 条

[1] Discovery of sequence-structure patterns across diverse proteins
Berger, B
BIOPHYSICAL JOURNAL, 2003, 84 (02) : 2A - 2A
[2] TRILOGY: Discovery of sequence-structure patterns across diverse proteins
Bradley, P
Kim, PS
Berger, B
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (13) : 8500 - 8505
[3] Discovering sequence-structure patterns in proteins with variable secondary structure
Milledge, Tom
Zheng, Gaolin
Narasimhan, Giri
COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 702 - 709
[4] Clustering of Protein Substructures for Discovery of a Novel Class of Sequence-Structure Fragments
Rudolfova, Ivana
Zendulka, Jaroslav
Lexa, Matej
INFORMATION TECHNOLOGY IN BIO- AND MEDICAL INFORMATICS, 2010, 6266 : 94 - 101
[5] Sequence-structure relationships in proteins
Elber, R
Qiu, J
Meyerguz, L
Kleinberg, J
Soft Condensed Matter Physics in Molecular and Cell Biology, 2006, : 201 - 224
[6] Sequence-structure relations of biopolymers
Barrett, Christopher
Huang, Fenix W.
Reidys, Christian M.
BIOINFORMATICS, 2017, 33 (03) : 382 - 389
[7] STATISTICS OF SEQUENCE-STRUCTURE THREADING
BRYANT, SH
ALTSCHUL, SF
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (02) : 236 - 244
[8] The Boltzmann Sequence-Structure Channel
Magner, Abram
Kihara, Daisuke
Szpankowski, Wojciech
2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 255 - 259
[9] A Bayes-optimal sequence-structure theory that unifies protein sequence-structure recognition and alignment
Richard H. Lathrop
Robert G. Rogers
Temple F. Smith
James V. White
Bulletin of Mathematical Biology, 1998, 60 (6) : 1039 - 1071
[10] A Bayes-optimal sequence-structure theory that unifies protein sequence-structure recognition and alignment
Lathrop, RH
Rogers, RG
Smith, TF
White, JV
BULLETIN OF MATHEMATICAL BIOLOGY, 1998, 60 (06) : 1039 - 1071

← 1 2 3 4 5 →