Conformational Sampling Spaces for Homology Modeling: A Missing Data Approach

被引:0
|
作者
Han, Rongsheng [1 ]
Wu, Guoqing [1 ]
Zhang, Meiling [2 ]
机构
[1] North China Elect Power Univ, 2 Beinong Rd, Beijing 102206, Peoples R China
[2] Tianjin Med Univ, Basic Med Coll, Tianjin 300070, Peoples R China
关键词
PROTEIN; ACCURACY;
D O I
暂无
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The distance weighted k-nearest neighbor (KNN) algorithm is proposed to impute missing values in the protein comparative modeling (CM). These missing values are caused by insertions/ deletions in the multiple structural alignments of the superfamily. Together with the principal component analysis (PCA) method and the anisotropic network model (ANM), evolutionary deformation information and topological information of proteins are extracted to help the construction of low dimensional sampling spaces for the conserved cores of amino acid backbones. Compared to the standard CM, our method utilizes more evolutionary information, and can study more core residues in the space. As a consequence, the sampling spaces are greatly enlarged. The qualities of sampling spaces are evaluated by distances between the query protein and the sampling space. The results of applications to a set of 33 representative and well studied superfamilies show that the accuracies of most of enlarged sampling spaces are below 1 angstrom. This implies that they are good candidates for further applications in protein structural researches.
引用
收藏
页码:544 / 549
页数:6
相关论文
共 50 条
  • [1] An efficient conformational sampling method for homology modeling
    Han, Rongsheng
    Leo-Macias, Alejandra
    Zerbino, Daniel
    Bastolla, Ugo
    Contreras-Moreira, Bruno
    Ortiz, Angel R.
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) : 175 - 188
  • [2] Modeling Multistage Sampling of Family Data with Missing Information
    Choi, Yuri Hee
    Briollais, Laurent
    [J]. GENETIC EPIDEMIOLOGY, 2008, 32 (07) : 683 - 683
  • [3] Chat Based Contact Center Modeling System Modeling, Parameter Estimation and Missing Data Sampling
    Enqvist, Per
    Svensson, Goran
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON OPERATIONS RESEARCH AND ENTERPRISE SYSTEMS (ICORES), 2017, : 464 - 469
  • [4] Estimation of Distribution Function Based on Ranked Set Sampling: Missing Data Approach
    Ashour, Samir Kamel
    Abdallah, Mohamed Soliman
    [J]. THAILAND STATISTICIAN, 2020, 18 (01): : 27 - 42
  • [5] Bayesian interval estimation of multiple correlations with missing data: A Gibbs sampling approach
    Gross, AL
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2000, 35 (02) : 201 - 227
  • [6] Explaining System Missing: Missing Data and Experience Sampling Method
    McLean, Derrick C.
    Nakamura, Jeanne
    Csikszentmihalyi, Mihaly
    [J]. SOCIAL PSYCHOLOGICAL AND PERSONALITY SCIENCE, 2017, 8 (04) : 434 - 441
  • [7] Iterated importance sampling in missing data problems
    Celeux, Gilles
    Marin, Jean-Michel
    Robert, Christian P.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (12) : 3386 - 3404
  • [8] Handling Missing Data Problems with Sampling Methods
    Houari, Rima
    Bounceur, Ahcene
    Tari, A-Kamel
    Kechadi, M-Tahar
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING DISTRIBUTED SYSTEMS AND APPLICATIONS (INDS 2014), 2014, : 99 - 104
  • [9] Random Subspace Sampling for Classification with Missing Data
    Cao, Yun-Hao
    Wu, Jian-Xin
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2024, 39 (02): : 472 - 486
  • [10] Splice variants: A homology modeling approach
    Furnham, N
    Ruffle, S
    Southan, C
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 54 (03) : 596 - 608