Spectral clustering of protein sequences

被引:0
|
作者
Paccanaro, A [1 ]
Chennubhotla, C [1 ]
Casbon, JA [1 ]
Saqi, MAS [1 ]
机构
[1] Univ London Queen Mary & Westfield Coll, Dept Med Microbiol, Bioinformat Unit, London E1 4NS, England
来源
PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4 | 2003年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major challenge in bioinformatics is the grouping together of protein sequences into functionally similar families. Large scale clustering of protein sequences may help to identify novel relationships and may also be of use in structural genomics. This paper explores the use of graph-theoretic spectral methods for clustering protein sequences. Using the leading eigenvectors of a matrix derived from similarity information between protein sequences we were able to obtain meaningful clusters on quite diverse sets of proteins. The results presented here show how this method is often able to identify correctly the superfamilies to which the sequences belong.
引用
收藏
页码:3083 / 3088
页数:6
相关论文
共 50 条
  • [41] Min-sum Clustering of Protein Sequences with Limited Distance Information
    Voevodski, Konstantin
    Balcan, Maria-Florina
    Roeglin, Heiko
    Teng, Shang-Hua
    Xia, Yu
    SIMILARITY-BASED PATTERN RECOGNITION, 2011, 7005 : 192 - +
  • [42] Spectral Sparsification in Spectral Clustering
    Chakeri, Alireza
    Farhidzadeh, Hamidreza
    Hall, Lawrence O.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2301 - 2306
  • [43] Spectral Sequences
    Penner, Robert
    TOPOLOGY AND K-THEORY: LECTURES BY DANIEL QUILLEN, 2020, 2262 : 113 - 118
  • [44] CLUSTERING CDNA SEQUENCES
    PARSONS, JD
    BRENNER, S
    BISHOP, MJ
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (05): : 461 - 466
  • [45] Clustering sequences by overlap
    Dorr, Dietmar H.
    Denton, Anne M.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2009, 3 (03) : 260 - 279
  • [46] Isofunctional Protein Subfamily Detection Using Data Integration and Spectral Clustering
    de Lima, Elisa Boari
    Meira Junior, Wagner
    de Melo-Minardi, Raquel Cardoso
    PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (06)
  • [47] Clustering Protein Sequences Using Affinity Propagation Based on an Improved Similarity Measure
    Yang, Fan
    Zhu, Qing-Xin
    Tang, Dong-Ming
    Zhao, Ming-Yuan
    EVOLUTIONARY BIOINFORMATICS, 2009, 5 : 137 - 146
  • [48] ProClust:: improved clustering of protein sequences with an extended graph-based approach
    Pipenbacher, P
    Schliep, A
    Schneckener, S
    Schönhuth, A
    Schomburg, D
    Schrader, R
    BIOINFORMATICS, 2002, 18 : S182 - S191
  • [49] Protein structure abstraction and automatic clustering using secondary structure element sequences
    Park, SH
    Park, CY
    Kim, DH
    Park, SH
    Sim, JS
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 1284 - 1292
  • [50] Module Detection for Bacteria Based on Spectral Clustering of Protein-Protein Functional Association Networks
    Wu, Hongwei
    Lin, Yaming
    Chan, Fun Choi
    Alba-Flores, Rocio
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 465 - 472