Prediction of protein N-formylation and comparison with N-acetylation based on a feature selection method

被引:12
|
作者
Zhou, You [1 ,2 ,3 ]
Huang, Tao [2 ,3 ]
Huang, Guohua [1 ]
Zhang, Ning [4 ]
Kong, XiangYin [2 ,3 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[2] Chinese Acad Sci, Inst Hlth Sci, Shanghai Inst Biol Sci, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Med, Shanghai, Peoples R China
[4] Tianjin Univ, Dept Biomed Engn, Tianjin Key Lab Biomed Engn Measurement, Tianjin, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
N-formylation; N-acetylation; Post-translational modification; Random forest; Incremental feature selection; LINKER HISTONE H1; LYSINE ACETYLATION; POSTTRANSLATIONAL MODIFICATIONS; INTRINSIC DISORDER; SITES; METHYLATION; SEQUENCES; PHOSPHORYLATION; IDENTIFICATION; DATABASE;
D O I
10.1016/j.neucom.2015.10.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Post-translational modifications play important roles in cell activities ranging from gene regulation to cytoplasmic mechanisms. Unfortunately, experimental methods investigating protein post-translational modifications such as high-resolution mass spectrometry are time consuming, labor-intensive and expensive. Therefore, there is a need to develop computational methods to facilitate fast and efficient identification. In this study, we developed a method to predict N-formylated methionines based on the Dagging method. Various features were incorporated, including PSSM conservation scores, amino acid factors, secondary structures, solvent accessibilities and disorder scores. An optimal feature set was selected containing 28 features using the mRMR (Maximum Relevance Minimum Redundancy) method and the IFS (Incremental Feature Selection) method. The prediction model constructed based on these features achieved an accuracy of 0.9074 and a MCC value of 0.7478. Analysis of these optimal features was performed, and several important factors and important sites were revealed to play important roles in N-formylation formation. We also compared N-formylation with N-acetylation, another type of important N-terminal modification of methionines. A total of top 34 MaxRel (most relevant) features were selected to discriminate between the two types of modifications, which may be candidates for studying the different mechanisms between N-formylation and N-acetylation. The results from our study further the understanding of these two types of modifications and provide guidance for related validation experiments. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 62
页数:10
相关论文
共 50 条
  • [21] Convenient, cost-effective, and mild method for the N-acetylation of anilines and secondary amines
    Prasad, HS
    Srinivasa, GR
    Gowda, DC
    SYNTHETIC COMMUNICATIONS, 2005, 35 (09) : 1189 - 1195
  • [22] Unifying effectors of circadian rhythm: Protein N-acetylation, phosphorylation, sulfation and other electrical effects
    Kovacic, Peter
    Somanathan, Ratnasamy
    JOURNAL OF ELECTROSTATICS, 2014, 72 (03) : 198 - 202
  • [23] Glycolaldehyde as a Bio-Based C1 Building Block for Selective N-Formylation of Secondary Amines
    Flynn, Matthew T.
    Liu, Xin
    Dell'Acqua, Andrea
    Rabeah, Jabor
    Brueckner, Angelika
    Barath, Eszter
    Tin, Sergey
    de Vries, Johannes G.
    CHEMSUSCHEM, 2022, 15 (20)
  • [24] Ag-Nanocatalysts Based on Porous Organic Polymers in Chemical Fixation of CO2 for the N-Methylation and N-Formylation of Amines
    Chakrabortty, Pekham
    Kumar, Susmitha
    Chowdhury, Avik
    Khan, Aslam
    Bhaumik, Asim
    Islam, Sk Manirul
    CHEMCATCHEM, 2024, 16 (05)
  • [25] A convenient method for the N-formylation of amines at room temperature using TiO2-P25 or sulfated titania
    Krishnakumar, B.
    Swaminathan, M.
    JOURNAL OF MOLECULAR CATALYSIS A-CHEMICAL, 2011, 334 (1-2) : 98 - 102
  • [26] Kinetic phenotypic diagnosis of N-acetylation polymorphism in patients based on ratio of urinary metabolites of salicylazosulfapyridine
    Yokogawa, K
    Nakaharu, T
    Ishizaki, J
    Ozaki, E
    Takeda, Y
    Mabuchi, H
    Matsushita, R
    Kimura, K
    Nakashima, E
    Ichimura, F
    Miyamoto, K
    INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2001, 229 (1-2) : 183 - 191
  • [27] An Improved and Efficient N-acetylation of Amines Using Choline Chloride Based Deep Eutectic Solvents
    Amic, Ana
    Molnar, Maja
    ORGANIC PREPARATIONS AND PROCEDURES INTERNATIONAL, 2017, 49 (03) : 249 - 257
  • [28] FrankSum: New feature selection method for protein function prediction
    Al-Shahib, A
    Breitling, R
    Gilbert, D
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (04) : 259 - 275
  • [29] Palladium(II) and platinum(II) based S∧N∧S and Se∧N∧Se pincer complexes as catalysts for CO2 hydrogenation and N-formylation of diethylamine to diethylformamide
    Mabena, Kgomotso G.
    Ocansey, Edward
    Kinfe, Henok H.
    Makhubela, Banothile C. E.
    JOURNAL OF CO2 UTILIZATION, 2021, 50
  • [30] An Efficient Method for N-Formylation of Amines Using Natural HEU Zeolite at Room Temperature Under Solvent-Free Conditions
    Bahari, Siavash
    Mohammadi-Aghdam, Babak
    Sajadi, S. Mohammad
    Zeidali, Fereshteh
    BULLETIN OF THE KOREAN CHEMICAL SOCIETY, 2012, 33 (07): : 2251 - 2254