Prediction of protein amidation sites by feature selection and analysis

被引:0
|
作者
Weiren Cui
Shen Niu
Lulu Zheng
Lele Hu
Tao Huang
Lei Gu
Kaiyan Feng
Ning Zhang
Yudong Cai
Yixue Li
机构
[1] Institute of Systems Biology,CAS
[2] Shanghai University,MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences
[3] Chinese Academy of Sciences,Department of Mathematics, College of Science
[4] Shanghai University,Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences
[5] Chinese Academy of Sciences,Hubei Bioinformatics and Molecular Imaging Key Laboratory
[6] Shanghai Center for Bioinformation Technology,Division of Theoretical Bioinformatics (BO80)
[7] Huazhong University of Science and Technology,Tianjin Key Lab of BME Measurement, Department of Biomedical Engineering
[8] German Cancer Research Center,undefined
[9] Tianjin University,undefined
来源
关键词
Amidation; Maximum relevance minimum redundancy; Incremental feature selection; Nearest neighbor algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Carboxy-terminal α-amidation is a widespread post-translational modification of proteins found widely in vertebrates and invertebrates. The α-amide group is required for full biological activity, since it may render a peptide more hydrophobic and thus better be able to bind to other proteins, preventing ionization of the C-terminus. However, in particular, the C-terminal amidation is very difficult to detect because experimental methods are often labor-intensive, time-consuming and expensive. Therefore, in silico methods may complement due to their high efficiency. In this study, a computational method was developed to predict protein amidation sites, by incorporating the maximum relevance minimum redundancy method and the incremental feature selection method based on the nearest neighbor algorithm. From a total of 735 features, 41 optimal features were selected and were utilized to construct the final predictor. As a result, the predictor achieved an overall Matthews correlation coefficient of 0.8308. Feature analysis showed that PSSM conservation scores and amino acid factors played the most important roles in the α-amidation site prediction. Site-specific feature analyses showed that features derived from the amidation site itself and adjacent sites were most significant. This method presented could be used as an efficient tool to theoretically predict amidated peptides. And the selected features from our study could shed some light on the in-depth understanding of the mechanisms of the amidation modification, providing guidelines for experimental validation.
引用
收藏
页码:391 / 400
页数:9
相关论文
共 50 条
  • [1] Prediction of protein amidation sites by feature selection and analysis
    Cui, Weiren
    Niu, Shen
    Zheng, Lulu
    Hu, Lele
    Huang, Tao
    Gu, Lei
    Feng, Kaiyan
    Zhang, Ning
    Cai, Yudong
    Li, Yixue
    [J]. MOLECULAR GENETICS AND GENOMICS, 2013, 288 (09) : 391 - 400
  • [2] PrAS: Prediction of amidation sites using multiple feature extraction
    Wang, Tong
    Zheng, Wei
    Wuyun, Qiqige
    Wu, Zhenfeng
    Ruan, Jishou
    Hu, Gang
    Gao, Jianzhao
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2017, 66 : 57 - 62
  • [3] Prediction of Protein Modification Sites of Pyrrolidone Carboxylic Acid Using mRMR Feature Selection and Analysis
    Zheng, Lu-Lu
    Niu, Shen
    Hao, Pei
    Feng, KaiYan
    Cai, Yu-Dong
    Li, Yixue
    [J]. PLOS ONE, 2011, 6 (12):
  • [4] Prediction of Protein Domain with mRMR Feature Selection and Analysis
    Li, Bi-Qing
    Hu, Le-Le
    Chen, Lei
    Feng, Kai-Yan
    Cai, Yu-Dong
    Chou, Kuo-Chen
    [J]. PLOS ONE, 2012, 7 (06):
  • [5] Prediction of Protein Subcellular Locations with Feature Selection and Analysis
    Cai, Yudong
    He, Jianfeng
    Li, Xinlei
    Feng, Kaiyan
    Lu, Lin
    Feng, Kairui
    Kong, Xiangyin
    Lu, Wencong
    [J]. PROTEIN AND PEPTIDE LETTERS, 2010, 17 (04): : 464 - 472
  • [6] Predicting protein oxidation sites with feature selection and analysis approach
    Niu, Shen
    Hu, Le-Le
    Zheng, Lu-Lu
    Huang, Tao
    Feng, Kai-Yan
    Cai, Yu-Dong
    Li, Hai-Peng
    Li, Yi-Xue
    Chou, Kuo-Chen
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2012, 29 (06): : 650 - 658
  • [7] Protein sumoylation sites prediction based on two-stage feature selection
    Lin Lu
    Xiao-He Shi
    Su-Jun Li
    Zhi-Qun Xie
    Yong-Li Feng
    Wen-Cong Lu
    Yi-Xue Li
    Haipeng Li
    Yu-Dong Cai
    [J]. Molecular Diversity, 2010, 14 : 81 - 86
  • [8] Protein sumoylation sites prediction based on two-stage feature selection
    Lu, Lin
    Shi, Xiao-He
    Li, Su-Jun
    Xie, Zhi-Qun
    Feng, Yong-Li
    Lu, Wen-Cong
    Li, Yi-Xue
    Li, Haipeng
    Cai, Yu-Dong
    [J]. MOLECULAR DIVERSITY, 2010, 14 (01) : 81 - 86
  • [9] Feature Selection for the Prediction of Translation Initiation Sites
    Guo-Liang Li* and Tze-Yun Leong Medical Computing Laboratory
    [J]. Genomics,Proteomics & Bioinformatics, 2005, (02) : 73 - 83
  • [10] Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method
    Gao, Jianzhao
    Tao, Xue-Wen
    Zhao, Jia
    Feng, Yuan-Ming
    Cai, Yu-Dong
    Zhang, Ning
    [J]. COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (07) : 629 - 637