Combining multiple clusterings for protein structure prediction

被引:6
|
作者
Sakar, C. Okan [1 ]
Kursun, Olcay [2 ]
Seker, Huseyin [3 ]
Gurgen, Fikret [4 ]
机构
[1] Bahcesehir Univ, Dept Comp Engn, Istanbul, Turkey
[2] Istanbul Univ, Dept Comp Engn, Istanbul, Turkey
[3] De Montfort Univ, Biohlth Informat Res Grp, Leicester LE1 9BH, Leics, England
[4] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
关键词
cluster ensembles; protein structure prediction; view selection; robust clustering; mutual information; bioinformatics; FEATURE-SELECTION; WEB-SERVER;
D O I
10.1504/IJDMB.2014.064012
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Computational annotation and prediction of protein structure is very important in the post-genome era due to existence of many different proteins, most of which are yet to be verified. Mutual information based feature selection methods can be used in selecting such minimal yet predictive subsets of features. However, as protein features are organised into natural partitions, individual feature selection that ignores the presence of these views, dismantles them, and treats their variables intermixed along with those of others at best results in a complex un-interpretable predictive system for such multi-view datasets. In this paper, instead of selecting a subset of individual features, each feature subset is passed through a clustering step so that it is represented in discrete form using the cluster indices; this makes mutual information based methods applicable to view-selection. We present our experimental results on a multi-view protein dataset that are used to predict protein structure.
引用
收藏
页码:162 / 174
页数:13
相关论文
共 50 条
  • [31] An efficient and scalable family of algorithms for combining clusterings
    Mimaroglu, Selim
    Erdil, Ertunc
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (10) : 2525 - 2539
  • [32] Finding multiple stable clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 991 - 1021
  • [33] A Low Dimensional Embedding Method for Combining Clusterings
    Xu Sen
    Zhou Tian
    Yu Hualong
    ADVANCED MANUFACTURING SYSTEMS, PTS 1-3, 2011, 201-203 : 2517 - +
  • [34] Learning Multiple Nonredundant Clusterings
    Cui, Ying
    Fern, Xiaoli Z.
    Dy, Jennifer G.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [35] Multiple Independent Subspace Clusterings
    Wang, Xing
    Wang, Jun
    Domeniconi, Carlotta
    Yu, Guoxian
    Xiao, Guoqiang
    Guo, Maozu
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5353 - 5360
  • [36] Finding Multiple Stable Clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 171 - 180
  • [37] Combining Data Clusterings with Instance Level Constraints
    Duarte, Joao M. M.
    Fred, Ana L. N.
    Duarte, F. Jorge F.
    PATTERN RECOGNITION IN INFORMATION SYSTEMS, PROCEEDINGS, 2009, : 49 - +
  • [38] Combining multiple clusterings via crowd agreement estimation and multi-granularity link analysis
    Huang, Dong
    Lai, Jian-Huang
    Wang, Chang-Dong
    NEUROCOMPUTING, 2015, 170 : 240 - 250
  • [39] Combining Multiple Clusterings of Chemical Structures Using Cumulative Voting-Based Aggregation Algorithm
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    Hentabli, Hamza
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 178 - 185
  • [40] Finding multiple stable clusterings
    Juhua Hu
    Qi Qian
    Jian Pei
    Rong Jin
    Shenghuo Zhu
    Knowledge and Information Systems, 2017, 51 : 991 - 1021