Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies

被引:334
|
作者
Atkinson, Holly J. [1 ,1 ]
Morris, John H. [2 ]
Ferrin, Thomas E. [1 ,2 ,3 ]
Babbitt, Patricia C. [1 ,2 ,3 ]
机构
[1] Univ Calif San Francisco, Inst Quantitative Biosci, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Dept Pharm Chem, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Dept Biopharm Sci, San Francisco, CA 94143 USA
来源
PLOS ONE | 2009年 / 4卷 / 02期
关键词
CRYSTAL-STRUCTURE; ALGORITHM; DATABASE; GENERATION; RECEPTORS; CYTOSCAPE; INFERENCE; FAMILIES; TOOLS; CLANS;
D O I
10.1371/journal.pone.0004345
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The dramatic increase in heterogeneous types of biological data-in particular, the abundance of new protein sequences requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity-GPCRs and kinases from humans, and the crotonase superfamily of enzymes-we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] EightyDVec: a method for protein sequence similarity analysis using physicochemical properties of amino acids
    Rout, Ranjeet Kumar
    Umer, Saiyed
    Sheikh, Sabha
    Sindhwani, Sanchit
    Pati, Smitarani
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2022, 10 (01): : 3 - 13
  • [42] Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships
    Espadaler, J
    Romero-Isart, O
    Jackson, RM
    Oliva, B
    BIOINFORMATICS, 2005, 21 (16) : 3360 - 3368
  • [43] Using heterozygosity to estimate a percentage DNA sequence similarity for environmental species' delimitation across basidiomycete fungi
    Hughes, Karen W.
    Petersen, Ronald H.
    Lickey, Edgar B.
    NEW PHYTOLOGIST, 2009, 182 (04) : 795 - 798
  • [44] Identifying Protein Complexes from PPI Networks Using GO Semantic Similarity
    Wang, Jian
    Xie, Dong
    Lin, Hongfei
    Yang, Zhihao
    Zhang, Yijia
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 582 - 585
  • [45] Alignment of PPI Networks Using Semantic Similarity for Conserved Protein Complex Prediction
    Shui, Yong
    Cho, Young-Rae
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2016, 15 (04) : 380 - 389
  • [46] Similarity-based link prediction in social networks using latent relationships between the users
    Ahmad Zareie
    Rizos Sakellariou
    Scientific Reports, 10
  • [47] Regions of minimal structural variation among members of protein domain superfamilies: application to remote homology detection and modelling using distant relationships
    Chakrabarti, S
    Sowdhamini, R
    FEBS LETTERS, 2004, 569 (1-3): : 31 - 36
  • [48] Similarity-based link prediction in social networks using latent relationships between the users
    Zareie, Ahmad
    Sakellariou, Rizos
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [49] Exploring ecosystem service issues across diverse knowledge domains using Bayesian Belief Networks
    Haines-Young, Roy
    PROGRESS IN PHYSICAL GEOGRAPHY-EARTH AND ENVIRONMENT, 2011, 35 (05): : 681 - 699
  • [50] A Review On Homology And Thread Modelling Of Protein Sequence Using Protein Similarity Search And Alphafold2 Colab
    Gupta, Chhote Lal Prasad
    Gaur, Shashank
    Soni, Manish Kumar
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 1110 - 1116