Sequence- and structure-based prediction of amyloidogenic regions in proteins

被引:0
|
作者
Hafida Bouziane
Abdallah Chouarfia
机构
[1] Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf,Département d’ Informatique
[2] USTO-MB,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
Protein misfolding; Amyloid aggregation; Secondary structure; Solvent accessibility; Support vector machine; String kernels;
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning methods are increasingly used in proteomics research, especially in analyzing and predicting protein structures, functions, subcellular localizations and interactions. However, much research in recent years has focused on protein misfolding problem and the impact of unfolded and defective proteins on cell dysfunction, due to its considerable importance for molecular medicine. These abnormal proteins degradation and deposition often result in the formation of certain plaque cores among them the so-called amyloid fibrils which are responsible for an increasing number of highly debilitating disorders in humans. Yet, a significant challenge remains, especially in understanding the underlying causes and major risk factors of these harmful deposits in vital organs and tissues. This paper explores the potential of string kernel-based support vector machines in the prediction of amyloidogenic regions in proteins by incorporating the most informative features of the protein sequence such as predicted secondary structure and solvent accessibility, with a special focus on α\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha $$\end{document}-helical conformations which seem to be primarily concerned with amyloidogenesis. The performances compared with the most popular methods on Pep424 and Reg33 benchmark datasets indicate the robustness of the predictive model. Furthermore, the results showed accurate prediction of regions promoting fibrillogenesis for experimentally determined amyloid proteins and revealed that the five amino acids Leucine, Glycine, Alanine, Valine and Serine are predominantly present in amyloid-prone regions and confirm that the core regions of an amyloid aggregate are not necessarily fully buried.
引用
收藏
页码:3285 / 3308
页数:23
相关论文
共 50 条
  • [11] Sequence homology of parathyroid hormone against amyloidogenic regions of proteins
    Benvenga, Salvatore
    Guarneri, Fabrizio
    Vita, Roberto
    ENDOCRINE, 2017, 55 (02) : 635 - 639
  • [12] Sequence Complexity of Amyloidogenic Regions in Intrinsically Disordered Human Proteins
    Das, Swagata
    Pal, Uttam
    Das, Supriya
    Bagga, Khyati
    Roy, Anupam
    Mrigwani, Arpita
    Maiti, Nakul C.
    PLOS ONE, 2014, 9 (03):
  • [13] Local structure prediction with local structure-based sequence profiles
    Yang, AS
    Wang, LY
    BIOINFORMATICS, 2003, 19 (10) : 1267 - 1274
  • [14] Sequence homology of parathyroid hormone against amyloidogenic regions of proteins
    Salvatore Benvenga
    Fabrizio Guarneri
    Roberto Vita
    Endocrine, 2017, 55 : 635 - 639
  • [15] FoldAmyloid: a method of prediction of amyloidogenic regions from protein sequence
    Garbuzynskiy, Sergiy O.
    Lobanov, Michail Yu.
    Galzitskaya, Oxana V.
    BIOINFORMATICS, 2010, 26 (03) : 326 - 332
  • [16] Structure-based prediction of methyl chemical shifts in proteins
    Aleksandr B. Sahakyan
    Wim F. Vranken
    Andrea Cavalli
    Michele Vendruscolo
    Journal of Biomolecular NMR, 2011, 50 : 331 - 346
  • [17] Structure-based prediction of methyl chemical shifts in proteins
    Sahakyan, Aleksandr B.
    Vranken, Wim F.
    Cavalli, Andrea
    Vendruscolo, Michele
    JOURNAL OF BIOMOLECULAR NMR, 2011, 50 (04) : 331 - 346
  • [18] Structure-based approach to the prediction of disulfide bonds in proteins
    Salam, Noeris K.
    Adzhigirey, Matvey
    Sherman, Woody
    Pearlman, David A.
    PROTEIN ENGINEERING DESIGN & SELECTION, 2014, 27 (10): : 365 - 374
  • [19] Sequence- and Structure-Based Functional Annotation and Assessment of Metabolic Transporters in Aspergillus oryzae: A Representative Case Study
    Raethong, Nachon
    Wong-ekkabut, Jirasak
    Laoteng, Kobkul
    Vongsangnak, Wanwipa
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [20] Two sequence- and two structure-based ML models have learned different aspects of protein biochemistry
    Kulikova, Anastasiya V.
    Diaz, Daniel J.
    Chen, Tianlong
    Cole, T. Jeffrey
    Ellington, Andrew D.
    Wilke, Claus O.
    SCIENTIFIC REPORTS, 2023, 13 (01)