NEW MDS AND CLUSTERING BASED ALGORITHMS FOR PROTEIN MODEL QUALITY ASSESSMENT AND SELECTION

被引:3
|
作者
Wang, Qingguo [1 ]
Shang, Charles [2 ]
Xu, Dong [3 ]
Shang, Yi [3 ]
机构
[1] Vanderbilt Univ, Bioinformat & Syst Med Lab, Nashville, TN 37203 USA
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[3] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
关键词
Protein tertiary structure prediction; model quality assessment; consensus method; clustering; multidimensional scaling; STRUCTURE PREDICTIONS; ENERGY;
D O I
10.1142/S0218213013600063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In protein tertiary structure prediction, assessing the quality of predicted models is an essential task. Over the past years, many methods have been proposed for the protein model quality assessment (QA) and selection problem. Despite significant advances, the discerning power of current methods is still unsatisfactory. In this paper, we propose two new algorithms, CC-Select and MDS-QA, based on multidimensional scaling and k-means clustering. For the model selection problem, CC-Select combines consensus with clustering techniques to select the best models from a given pool. Given a set of predicted models, CC-Select first calculates a consensus score for each structure based on its average pairwise structural similarity to other models. Then, similar structures are grouped into clusters using multidimensional scaling and clustering algorithms. In each cluster, the one with the highest consensus score is selected as a candidate model. For the QA problem, MDS-QA combines single-model scoring functions with consensus to determine more accurate assessment score for every model in a given pool. Using extensive benchmark sets of a large collection of predicted models, we compare the two algorithms with existing state-of-the-art quality assessment methods and show significant improvement.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Probabilistic assessment of model-based clustering
    Zhu, Xuwen
    Melnykov, Volodymyr
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2015, 9 (04) : 395 - 422
  • [42] A new fast prototype selection method based on clustering
    J. Arturo Olvera-López
    J. Ariel Carrasco-Ochoa
    J. Francisco Martínez-Trinidad
    Pattern Analysis and Applications, 2010, 13 : 131 - 141
  • [43] Probabilistic assessment of model-based clustering
    Xuwen Zhu
    Volodymyr Melnykov
    Advances in Data Analysis and Classification, 2015, 9 : 395 - 422
  • [44] A new fast prototype selection method based on clustering
    Arturo Olvera-Lopez, J.
    Ariel Carrasco-Ochoa, J.
    Francisco Martinez-Trinidad, J.
    PATTERN ANALYSIS AND APPLICATIONS, 2010, 13 (02) : 131 - 141
  • [45] Evaluation of FCV and FCM Clustering Algorithms in Cluster-Based Compound Selection
    Suhaili, Sinarwati Mohamad
    Jambli, Mohamad Nazim
    2011 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN ASIA (CITA 11), 2011,
  • [46] Algorithms of nonlinear document clustering based on fuzzy multiset model
    Mizutani, Kiyotaka
    Inokuchi, Ryo
    MiyaMoto, Sadaaki
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2008, 23 (02) : 176 - 198
  • [47] Genetic-guided model-based clustering algorithms
    Jin, HD
    Leung, KS
    Wong, ML
    IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 653 - 659
  • [48] A new method for the initialization of clustering algorithms based on histogram analysis
    Castro, Alfonso
    Boveda, Carmen
    Arcay, Bernardino
    PROCEEDINGS OF THE SEVENTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, 2007, : 176 - +
  • [49] Research on portfolio selection model based on genetic algorithms
    Zhou, HT
    Fei, Q
    Liu, XK
    PROCEEDINGS OF 2002 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2002, : 1661 - 1665
  • [50] New developments of the Taguchi quality selection model
    Huang, YF
    QUALITY & QUANTITY, 2004, 38 (02) : 205 - 215