Computational identification of 4-carboxyglutamate sites to supplement physiological studies using deep learning

被引:6
|
作者
Naseer, Sheraz [1 ]
Ali, Rao Faizan [1 ,2 ]
Fati, Suliman Mohamed [3 ]
Muneer, Amgad [2 ]
机构
[1] Univ Management & Technol, Dept Comp Sci, Lahore 54770, Pakistan
[2] Univ Teknol Petronas, Comp & Informat Sci Dept, Seri Iskandar 32610, Perak, Malaysia
[3] Prince Sultan Univ, Coll Comp & Informat Sci, Riyadh 11586, Saudi Arabia
关键词
PROTEINS; REPRESENTATIONS; ACCURACY; GLA;
D O I
10.1038/s41598-021-03895-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In biological systems, Glutamic acid is a crucial amino acid which is used in protein biosynthesis. Carboxylation of glutamic acid is a significant post-translational modification which plays important role in blood coagulation by activating prothrombin to thrombin. Contrariwise, 4-carboxy-glutamate is also found to be involved in diseases including plaque atherosclerosis, osteoporosis, mineralized heart valves, bone resorption and serves as biomarker for onset of these diseases. Owing to the pathophysiological significance of 4-carboxyglutamate, its identification is important to better understand pathophysiological systems. The wet lab identification of prospective 4-carboxyglutamate sites is costly, laborious and time consuming due to inherent difficulties of in-vivo, ex-vivo and in vitro experiments. To supplement these experiments, we proposed, implemented, and evaluated a different approach to develop 4-carboxyglutamate site predictors using pseudo amino acid compositions (PseAAC) and deep neural networks (DNNs). Our approach does not require any feature extraction and employs deep neural networks to learn feature representation of peptide sequences and performing classification thereof. Proposed approach is validated using standard performance evaluation metrics. Among different deep neural networks, convolutional neural network-based predictor achieved best scores on independent dataset with accuracy of 94.7%, AuC score of 0.91 and F1-score of 0.874 which shows the promise of proposed approach. The iCarboxE-Deep server is deployed at https://share.streamlit.io/sheraz-n/carboxyglutamate/app.py.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Scaffolding protein functional sites using deep learning
    Wang, Jue
    Lisanza, Sidney
    Juergens, David
    Tischer, Doug
    Watson, Joseph L.
    Castro, Karla M.
    Ragotte, Robert
    Saragovi, Amijai
    Milles, Lukas F.
    Baek, Minkyung
    Anishchenko, Ivan
    Yang, Wei
    Hicks, Derrick R.
    Exposit, Marc
    Schlichthaerle, Thomas
    Chun, Jung-Ho
    Dauparas, Justas
    Bennett, Nathaniel
    Wicky, Basile I. M.
    Muenks, Andrew
    DiMaio, Frank
    Correia, Bruno
    Ovchinnikov, Sergey
    Baker, David
    SCIENCE, 2022, 377 (6604) : 387 - +
  • [32] Computational drug discovery using deep learning approaches
    Isayev, Olexandr
    Politi, Regina
    Tropsha, Alexander
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [33] Computational Acceleration of Topology Optimization Using Deep Learning
    Rasulzade, Jalal
    Rustamov, Samir
    Akhmetov, Bakytzhan
    Maksum, Yelaman
    Nogaibayeva, Makpal
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [34] Computational screening of umami tastants using deep learning
    Dutta, Prantar
    Gajula, Kishore
    Verma, Nitu
    Jain, Deepak
    Gupta, Rakesh
    Rai, Beena
    MOLECULAR DIVERSITY, 2024,
  • [36] Characterization and Identification of Lysine Succinylation Sites based on Deep Learning Method
    Huang, Kai-Yao
    Hsu, Justin Bo-Kai
    Lee, Tzong-Yi
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [37] Characterization and Identification of Lysine Succinylation Sites based on Deep Learning Method
    Kai-Yao Huang
    Justin Bo-Kai Hsu
    Tzong-Yi Lee
    Scientific Reports, 9
  • [38] Dog Breed Identification Using Deep Learning
    Raduly, Zalan
    Sulyok, Csaba
    Vadaszi, Zsolt
    Zolde, Attila
    2018 IEEE 16TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY 2018), 2018, : 271 - 275
  • [39] Identification of Rodent Species Using Deep Learning
    Seijas, Cesar
    Montilla, Guillermo
    Frassato, Luigi
    COMPUTACION Y SISTEMAS, 2019, 23 (01): : 257 - 266
  • [40] Spoken Language Identification Using Deep Learning
    Singh, Gundeep
    Sharma, Sahil
    Kumar, Vijay
    Kaur, Manjit
    Baz, Mohammed
    Masud, Mehedi
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021