Protein sequence similarity searches using patterns as seeds

被引:236
|
作者
Zhang, Z
Schaffer, AA
Miller, W
Madden, TL
Lipman, DJ
Koonin, EV
Altschul, SF [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
[2] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[3] Natl Human Genome Res Inst, Inherited Dis Res Branch, NIH, Baltimore, MD 21224 USA
关键词
D O I
10.1093/nar/26.17.3986
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein families often are characterized by conserved sequence patterns or motifs, A researcher frequently wishes to evaluate the significance of a specific pattern within a protein, or to exploit knowledge of known motifs to aid the recognition of greatly diverged but homologous family members, To assist in these efforts, the pattern-hit initiated BLAST (PHI-BLAST) program described here takes as input both a protein sequence and a pattern of interest that it contains. PHI-BLAST searches a protein database for other instances of the input pattern, and uses those found as seeds for the construction of local alignments to the query sequence. The random distribution of PHI-BLAST alignment scores is studied analytically and empirically. In many instances, the program is able to detect statistically significant similarity between homologous proteins that are not recognizably related using traditional single-pass database search methods, PHI-BLAST is applied to the analysis of CED4-like cell death regulators, HS90-type ATPase domains, archaeal tRNA nucleotidyltransferases and archaeal homologs of DnaG-type DNA primases.
引用
收藏
页码:3986 / 3990
页数:5
相关论文
共 50 条
  • [41] Distance Threshold Similarity Searches on Spatiotemporal Trajectories using GPGPU
    Gowanlock, Michael
    Casanova, Henri
    [J]. 2014 21ST INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2014,
  • [42] Exploring the sequence, function, and evolutionary space of protein superfamilies using sequence similarity networks and phylogenetic reconstructions
    Copp, Janine N.
    Anderson, Dave W.
    Akiva, Eyal
    Babbitt, Patricia C.
    Tokuriki, Nobuhiko
    [J]. NEW APPROACHES FOR FLAVIN CATALYSIS, 2019, 620 : 315 - 347
  • [43] Mining image sequence similarity patterns in brain images
    Pan, Haiwei
    Xie, Xiaoqin
    Wei, Zhang
    Li, Jianzhong
    [J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 965 - 969
  • [44] Analysis of protein sequence/structure similarity relationships
    Gan, HH
    Perlow, RA
    Roy, S
    Ko, J
    Wu, M
    Huang, J
    Yan, SX
    Nicoletta, A
    Vafai, J
    Sun, D
    Wang, LH
    Noah, JE
    Pasquali, S
    Schlick, T
    [J]. BIOPHYSICAL JOURNAL, 2002, 83 (05) : 2781 - 2791
  • [45] Similarity searches in computer security
    Smith, SF
    [J]. SAM'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SECURITY AND MANAGEMENT, VOLS 1 AND 2, 2003, : 658 - 662
  • [46] Similarity Searches in Face Databases
    Franco, Annalisa
    Maio, Dario
    [J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2009, PROCEEDINGS, 2009, 5716 : 443 - 450
  • [47] Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs"
    Matthews, LR
    Vaglio, P
    Reboul, J
    Ge, H
    Davis, BP
    Garrels, J
    Vincent, S
    Vidal, M
    [J]. GENOME RESEARCH, 2001, 11 (12) : 2120 - 2126
  • [48] Using hybrid alignment for iterative sequence database searches
    Li, YH
    Lauria, M
    Bundschuh, R
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (09): : 841 - 853
  • [49] Prediction of Protein Pairs Sharing Common Active Ligands Using Protein Sequence, Structure, and Ligand Similarity
    Chen, Yu-Chen
    Tolber, Robert
    Aronov, Alex M.
    McGaughey, Georgia
    Walters, W. Patrick
    Meireles, Lidio
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (09) : 1734 - 1745
  • [50] iSARST: an integrated SARST web server for rapid protein structural similarity searches
    Lo, Wei-Cheng
    Lee, Che-Yu
    Lee, Chi-Ching
    Lyu, Ping-Chiang
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : W545 - W551