Combining multiple clusterings for protein structure prediction

被引:6
|
作者
Sakar, C. Okan [1 ]
Kursun, Olcay [2 ]
Seker, Huseyin [3 ]
Gurgen, Fikret [4 ]
机构
[1] Bahcesehir Univ, Dept Comp Engn, Istanbul, Turkey
[2] Istanbul Univ, Dept Comp Engn, Istanbul, Turkey
[3] De Montfort Univ, Biohlth Informat Res Grp, Leicester LE1 9BH, Leics, England
[4] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
关键词
cluster ensembles; protein structure prediction; view selection; robust clustering; mutual information; bioinformatics; FEATURE-SELECTION; WEB-SERVER;
D O I
10.1504/IJDMB.2014.064012
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Computational annotation and prediction of protein structure is very important in the post-genome era due to existence of many different proteins, most of which are yet to be verified. Mutual information based feature selection methods can be used in selecting such minimal yet predictive subsets of features. However, as protein features are organised into natural partitions, individual feature selection that ignores the presence of these views, dismantles them, and treats their variables intermixed along with those of others at best results in a complex un-interpretable predictive system for such multi-view datasets. In this paper, instead of selecting a subset of individual features, each feature subset is passed through a clustering step so that it is represented in discrete form using the cluster indices; this makes mutual information based methods applicable to view-selection. We present our experimental results on a multi-view protein dataset that are used to predict protein structure.
引用
收藏
页码:162 / 174
页数:13
相关论文
共 50 条
  • [1] Combining multiple weak clusterings
    Topchy, A
    Jain, AK
    Punch, W
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 331 - 338
  • [2] CLICOM: Cliques for combining multiple clusterings
    Mimaroglu, Selim
    Yagci, Murat
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 1889 - 1901
  • [3] Combining multiple clusterings by soft correspondence
    Long, B
    Zhang, ZF
    Yu, PS
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 282 - 289
  • [4] On combining multiple clusterings: an overview and a new perspective
    Tao Li
    Mitsunori Ogihara
    Sheng Ma
    Applied Intelligence, 2010, 33 : 207 - 219
  • [5] On combining multiple clusterings: an overview and a new perspective
    Li, Tao
    Ogihara, Mitsunori
    Ma, Sheng
    APPLIED INTELLIGENCE, 2010, 33 (02) : 207 - 219
  • [6] Combining multiple clusterings using evidence accumulation
    Fred, ALN
    Jain, AK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) : 835 - 850
  • [7] Developments in Partitioning XML Documents by Content and Structure based on Combining Multiple Clusterings
    Costa, Gianni
    Ortale, Riccardo
    2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 477 - 482
  • [8] Combining multiple clusterings using similarity graph
    Mimaroglu, Selim
    Erdil, Ertunc
    PATTERN RECOGNITION, 2011, 44 (03) : 694 - 703
  • [9] Improved performance in protein secondary structure prediction by combining multiple predictions
    Huang, De-Shuang
    Huang, Xin
    PROTEIN AND PEPTIDE LETTERS, 2006, 13 (10): : 985 - 991
  • [10] Consensus Methods for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (05) : 1026 - 1034