Spectral clustering of protein sequences

被引:0
|
作者
Paccanaro, A [1 ]
Chennubhotla, C [1 ]
Casbon, JA [1 ]
Saqi, MAS [1 ]
机构
[1] Univ London Queen Mary & Westfield Coll, Dept Med Microbiol, Bioinformat Unit, London E1 4NS, England
来源
PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4 | 2003年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major challenge in bioinformatics is the grouping together of protein sequences into functionally similar families. Large scale clustering of protein sequences may help to identify novel relationships and may also be of use in structural genomics. This paper explores the use of graph-theoretic spectral methods for clustering protein sequences. Using the leading eigenvectors of a matrix derived from similarity information between protein sequences we were able to obtain meaningful clusters on quite diverse sets of proteins. The results presented here show how this method is often able to identify correctly the superfamilies to which the sequences belong.
引用
收藏
页码:3083 / 3088
页数:6
相关论文
共 50 条
  • [31] Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions
    Mai, Te-Lun
    Hu, Geng-Ming
    Chen, Chi-Ming
    JOURNAL OF PROTEOME RESEARCH, 2016, 15 (07) : 2123 - 2131
  • [32] Spectral clustering for detecting protein complexes in protein-protein interaction (PPI) networks
    Qin, Guimin
    Gao, Lin
    MATHEMATICAL AND COMPUTER MODELLING, 2010, 52 (11-12) : 2066 - 2074
  • [33] An approach to functionally relevant clustering of the protein universe: Active site profile-based clustering of protein structures and sequences
    Knutson, Stacy T.
    Westwood, Brian M.
    Leuthaeuser, Janelle B.
    Turner, Brandon E.
    Nguyendac, Don
    Shea, Gabrielle
    Kumar, Kiran
    Hayden, Julia D.
    Harper, Angela F.
    Brown, Shoshana D.
    Morris, John H.
    Ferrin, Thomas E.
    Babbitt, Patricia C.
    Fetrow, Jacquelyn S.
    PROTEIN SCIENCE, 2017, 26 (04) : 677 - 699
  • [34] Protein Sequences Clustering of Herpes Virus by Using Tribe Markov Clustering (Tribe-MCL)
    Bustamam, A.
    Siswantining, T.
    Febriyani, N. L.
    Novitasari, I. D.
    Cahyaningrum, R. D.
    INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2016 (ISCPMS 2016), 2017, 1862
  • [35] Diffusion Model Based Spectral Clustering for Protein-Protein Interaction Networks
    Inoue, Kentaro
    Li, Weijiang
    Kurata, Hiroyuki
    PLOS ONE, 2010, 5 (09): : 1 - 10
  • [36] A NEW CLUSTERING SYSTEM FOR PROTEIN SEQUENCES AND ITS APPLICATION TO CONSTRAINTS DISCOVERY IN PROTEIN EVOLUTION
    NAKAGAWA, H
    KOYAMA, Y
    KAWAI, T
    OKADA, T
    INTERNATIONAL JOURNAL OF PEPTIDE AND PROTEIN RESEARCH, 1995, 46 (05): : 440 - 451
  • [37] Unsupervised protein sequences clustering algorithm using functional domain information
    Chen, Wei-Bang
    Zhang, Chengcui
    Zhong, Hua
    PROCEEDINGS OF THE 2008 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 76 - 81
  • [38] Min-sum Clustering of Protein Sequences with Limited Distance Information
    Voevodski, Konstantin
    Balcan, Maria-Florina
    Roeglin, Heiko
    Teng, Shang-Hua
    Xia, Yu
    SIMILARITY-BASED PATTERN RECOGNITION: FIRST INTERNATIONAL WORKSHOP, SIMBAD 2011, 2011, 7005 : 192 - 206
  • [39] Clustering of highly homologous sequences to reduce the size of large protein databases
    Li, WZ
    Jaroszewski, L
    Godzik, A
    BIOINFORMATICS, 2001, 17 (03) : 282 - 283
  • [40] Mining for representative regions of virus genuses via protein sequences clustering
    Wang, Jing-Doo
    Wang, Yi-Chun
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 9 (03) : 321 - 337