Prediction of protein N-formylation and comparison with N-acetylation based on a feature selection method

被引:12
|
作者
Zhou, You [1 ,2 ,3 ]
Huang, Tao [2 ,3 ]
Huang, Guohua [1 ]
Zhang, Ning [4 ]
Kong, XiangYin [2 ,3 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[2] Chinese Acad Sci, Inst Hlth Sci, Shanghai Inst Biol Sci, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Med, Shanghai, Peoples R China
[4] Tianjin Univ, Dept Biomed Engn, Tianjin Key Lab Biomed Engn Measurement, Tianjin, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
N-formylation; N-acetylation; Post-translational modification; Random forest; Incremental feature selection; LINKER HISTONE H1; LYSINE ACETYLATION; POSTTRANSLATIONAL MODIFICATIONS; INTRINSIC DISORDER; SITES; METHYLATION; SEQUENCES; PHOSPHORYLATION; IDENTIFICATION; DATABASE;
D O I
10.1016/j.neucom.2015.10.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Post-translational modifications play important roles in cell activities ranging from gene regulation to cytoplasmic mechanisms. Unfortunately, experimental methods investigating protein post-translational modifications such as high-resolution mass spectrometry are time consuming, labor-intensive and expensive. Therefore, there is a need to develop computational methods to facilitate fast and efficient identification. In this study, we developed a method to predict N-formylated methionines based on the Dagging method. Various features were incorporated, including PSSM conservation scores, amino acid factors, secondary structures, solvent accessibilities and disorder scores. An optimal feature set was selected containing 28 features using the mRMR (Maximum Relevance Minimum Redundancy) method and the IFS (Incremental Feature Selection) method. The prediction model constructed based on these features achieved an accuracy of 0.9074 and a MCC value of 0.7478. Analysis of these optimal features was performed, and several important factors and important sites were revealed to play important roles in N-formylation formation. We also compared N-formylation with N-acetylation, another type of important N-terminal modification of methionines. A total of top 34 MaxRel (most relevant) features were selected to discriminate between the two types of modifications, which may be candidates for studying the different mechanisms between N-formylation and N-acetylation. The results from our study further the understanding of these two types of modifications and provide guidance for related validation experiments. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 62
页数:10
相关论文
共 50 条
  • [1] N-ACETYLATION AND N-FORMYLATION OF CARCINOGENIC ARYLAMINES AND RELATED-COMPOUNDS IN DOGS
    OKUMURA, F
    UEDA, O
    KITAMURA, S
    TATSUMI, K
    CARCINOGENESIS, 1995, 16 (01) : 71 - 76
  • [2] N-acetylation and N-formylation of m-aminobenzoic acid by cell suspension cultures of Solanum laciniatum
    Syahrania, A
    Panjaitan, TS
    Indrayanto, G
    Wilkins, AL
    JOURNAL OF ASIAN NATURAL PRODUCTS RESEARCH, 2000, 2 (04) : 305 - 309
  • [3] TiO2 NPs as Catalyst for N-Formylation and N-Acetylation of Amines Under Solvent-Free Conditions
    Tajbakhsh, Mahmood
    Rahman, Hosseinzadeh
    Heshmatollah, Alinezhad
    Parizad, Rezaee
    Tajbakhsh, Mahgol
    LETTERS IN ORGANIC CHEMISTRY, 2013, 10 (09) : 657 - 663
  • [4] Predicting N-terminal acetylation based on feature selection method
    Cai, Yu-Dong
    Lu, Lin
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2008, 372 (04) : 862 - 865
  • [5] New synthetic method for the N-acetylation of cysteine
    Xu, Heng
    Zhang, Qun
    Kong, Xue-jun
    Jingxi Huagong/Fine Chemicals, 2000, 17 (04): : 205 - 207
  • [6] Prediction of protein N-formylation using the composition of k-spaced amino acid pairs
    Ju, Zhe
    Cao, Jun-Zhe
    ANALYTICAL BIOCHEMISTRY, 2017, 534 : 40 - 45
  • [7] Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method
    Gao, Jianzhao
    Tao, Xue-Wen
    Zhao, Jia
    Feng, Yuan-Ming
    Cai, Yu-Dong
    Zhang, Ning
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (07) : 629 - 637
  • [8] RAPID AND EFFICIENT METHOD FOR THE N-FORMYLATION OF C-BLOCKED PEPTIDES
    LAJOIE, G
    KRAUS, JL
    PEPTIDES, 1984, 5 (03) : 653 - 654
  • [9] N-formylation of amines with CO2 by using Zr-based metal-organic frameworks: Contribution of defect sites of MOFs to N-formylation
    Yoo, Dong Kyu
    Jhung, Sung Hwa
    APPLIED CATALYSIS A-GENERAL, 2023, 659
  • [10] A convenient method for the N-formylation of secondary amines and anilines using ammonium formate
    Reddy, PG
    Kumar, GDK
    Baskaran, S
    TETRAHEDRON LETTERS, 2000, 41 (47) : 9149 - 9151