Protein sequence similarity searches using patterns as seeds

被引：236

作者：

Zhang, Z

Schaffer, AA

Miller, W

Madden, TL

Lipman, DJ

Koonin, EV

Altschul, SF ^{[1
]}

机构：

[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA

[2] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA

[3] Natl Human Genome Res Inst, Inherited Dis Res Branch, NIH, Baltimore, MD 21224 USA

来源：

NUCLEIC ACIDS RESEARCH | 1998年 / 26卷 / 17期

关键词：

D O I：

10.1093/nar/26.17.3986

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Protein families often are characterized by conserved sequence patterns or motifs, A researcher frequently wishes to evaluate the significance of a specific pattern within a protein, or to exploit knowledge of known motifs to aid the recognition of greatly diverged but homologous family members, To assist in these efforts, the pattern-hit initiated BLAST (PHI-BLAST) program described here takes as input both a protein sequence and a pattern of interest that it contains. PHI-BLAST searches a protein database for other instances of the input pattern, and uses those found as seeds for the construction of local alignments to the query sequence. The random distribution of PHI-BLAST alignment scores is studied analytically and empirically. In many instances, the program is able to detect statistically significant similarity between homologous proteins that are not recognizably related using traditional single-pass database search methods, PHI-BLAST is applied to the analysis of CED4-like cell death regulators, HS90-type ATPase domains, archaeal tRNA nucleotidyltransferases and archaeal homologs of DnaG-type DNA primases.

引用

页码：3986 / 3990

页数：5

共 50 条

[31] Flexible information visualization of multivariate data from biological sequence similarity searches
Chi, EHH
Riedl, J
Shoop, E
Carlis, JV
Retzel, E
Barry, P
[J]. VISUALIZATION '96, PROCEEDINGS, 1996, : 133 - +
[32] PyMod: sequence similarity searches, multiple sequence-structure alignments, and homology modeling within PyMOL
Bramucci, Emanuele
Paiardini, Alessandro
Bossa, Francesco
Pascarella, Stefano
[J]. BMC BIOINFORMATICS, 2012, 13
[33] PyMod: sequence similarity searches, multiple sequence-structure alignments, and homology modeling within PyMOL
Emanuele Bramucci
Alessandro Paiardini
Francesco Bossa
Stefano Pascarella
[J]. BMC Bioinformatics, 13
[34] Improving protein structure prediction with extended sequence similarity searches and deep-learning-based refinement in CASP15
Oda, Toshiyuki
[J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2023, 91 (12) : 1712 - 1723
[35] Fast structure similarity searches among protein models: efficient clustering of protein fragments
Fogolari, Federico
Corazza, Alessandra
Viglino, Paolo
Esposito, Gennaro
[J]. ALGORITHMS FOR MOLECULAR BIOLOGY, 2012, 7
[36] Fast structure similarity searches among protein models: efficient clustering of protein fragments
Federico Fogolari
Alessandra Corazza
Paolo Viglino
Gennaro Esposito
[J]. Algorithms for Molecular Biology, 7
[37] RAPID SIMILARITY SEARCHES OF NUCLEIC-ACID AND PROTEIN DATA BANKS
WILBUR, WJ
LIPMAN, DJ
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1983, 80 (03): : 726 - 730
[38] Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies
Atkinson, Holly J.
Morris, John H.
Ferrin, Thomas E.
Babbitt, Patricia C.
[J]. PLOS ONE, 2009, 4 (02):
[39] A new method to analyze protein sequence similarity using Dynamic Time Warping
Hou, Wenbing
Pan, Qiuhui
Peng, Qianying
He, Mingfeng
[J]. GENOMICS, 2017, 109 (02) : 123 - 130
[40] Efficient Exact Similarity Searches using Multiple Token Orderings
Kim, Jongik
Lee, Hongrae
[J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 822 - 833

← 1 2 3 4 5 →