Combining multiple clusterings for protein structure prediction

被引:6
|
作者
Sakar, C. Okan [1 ]
Kursun, Olcay [2 ]
Seker, Huseyin [3 ]
Gurgen, Fikret [4 ]
机构
[1] Bahcesehir Univ, Dept Comp Engn, Istanbul, Turkey
[2] Istanbul Univ, Dept Comp Engn, Istanbul, Turkey
[3] De Montfort Univ, Biohlth Informat Res Grp, Leicester LE1 9BH, Leics, England
[4] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
关键词
cluster ensembles; protein structure prediction; view selection; robust clustering; mutual information; bioinformatics; FEATURE-SELECTION; WEB-SERVER;
D O I
10.1504/IJDMB.2014.064012
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Computational annotation and prediction of protein structure is very important in the post-genome era due to existence of many different proteins, most of which are yet to be verified. Mutual information based feature selection methods can be used in selecting such minimal yet predictive subsets of features. However, as protein features are organised into natural partitions, individual feature selection that ignores the presence of these views, dismantles them, and treats their variables intermixed along with those of others at best results in a complex un-interpretable predictive system for such multi-view datasets. In this paper, instead of selecting a subset of individual features, each feature subset is passed through a clustering step so that it is represented in discrete form using the cluster indices; this makes mutual information based methods applicable to view-selection. We present our experimental results on a multi-view protein dataset that are used to predict protein structure.
引用
收藏
页码:162 / 174
页数:13
相关论文
共 50 条
  • [41] Combining hydrophobicity and helicity: A novel approach to membrane protein structure prediction
    Liu, LP
    Deber, CM
    BIOORGANIC & MEDICINAL CHEMISTRY, 1999, 7 (01) : 1 - 7
  • [42] A general method for combining predictors tested on protein secondary structure prediction
    Hansen, JV
    Krogh, A
    ARTIFICIAL NEURAL NETWORKS IN MEDICINE AND BIOLOGY, 2000, : 259 - 264
  • [43] Accurate prediction of protein assembly structure by combining AlphaFold and symmetrical docking
    Jeppesen, Mads
    Andre, Ingemar
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [44] Accurate prediction of protein assembly structure by combining AlphaFold and symmetrical docking
    Mads Jeppesen
    Ingemar André
    Nature Communications, 14
  • [45] Adaptive Cumulative Voting-Based Aggregation Algorithm for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    Hentabli, Hamza
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 305 - 314
  • [46] Multiple Co-Clusterings
    Wang, Xing
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Yu, Zhiwen
    Zhang, Zili
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1308 - 1313
  • [47] A multiple minima genetic algorithm for protein structure prediction
    Custodio, Fabio Lima
    Barbosa, Helio J. C.
    Dardenne, Laurent Emmanuel
    APPLIED SOFT COMPUTING, 2014, 15 : 88 - 99
  • [48] Multiple linear regression for protein secondary structure prediction
    Pan, XM
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2001, 43 (03) : 256 - 259
  • [49] Multiple Feature Fusion Protein Tertiary Structure Prediction
    Bao, Wenzheng
    Chen, Yuehui
    Chen, Yiming
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CLOUD COMPUTING COMPANION (ISCC-C), 2014, : 751 - 756