Sequence- and structure-based prediction of amyloidogenic regions in proteins

被引:0
|
作者
Hafida Bouziane
Abdallah Chouarfia
机构
[1] Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf,Département d’ Informatique
[2] USTO-MB,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
Protein misfolding; Amyloid aggregation; Secondary structure; Solvent accessibility; Support vector machine; String kernels;
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning methods are increasingly used in proteomics research, especially in analyzing and predicting protein structures, functions, subcellular localizations and interactions. However, much research in recent years has focused on protein misfolding problem and the impact of unfolded and defective proteins on cell dysfunction, due to its considerable importance for molecular medicine. These abnormal proteins degradation and deposition often result in the formation of certain plaque cores among them the so-called amyloid fibrils which are responsible for an increasing number of highly debilitating disorders in humans. Yet, a significant challenge remains, especially in understanding the underlying causes and major risk factors of these harmful deposits in vital organs and tissues. This paper explores the potential of string kernel-based support vector machines in the prediction of amyloidogenic regions in proteins by incorporating the most informative features of the protein sequence such as predicted secondary structure and solvent accessibility, with a special focus on α\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha $$\end{document}-helical conformations which seem to be primarily concerned with amyloidogenesis. The performances compared with the most popular methods on Pep424 and Reg33 benchmark datasets indicate the robustness of the predictive model. Furthermore, the results showed accurate prediction of regions promoting fibrillogenesis for experimentally determined amyloid proteins and revealed that the five amino acids Leucine, Glycine, Alanine, Valine and Serine are predominantly present in amyloid-prone regions and confirm that the core regions of an amyloid aggregate are not necessarily fully buried.
引用
收藏
页码:3285 / 3308
页数:23
相关论文
共 50 条
  • [1] Sequence- and structure-based prediction of amyloidogenic regions in proteins
    Bouziane, Hafida
    Chouarfia, Abdallah
    SOFT COMPUTING, 2020, 24 (05) : 3285 - 3308
  • [2] Sequence- and structure-based analysis of proteins involved in miRNA biogenesis
    Sharma, Chhaya
    Mohanty, Debasisa
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2018, 36 (01): : 139 - 151
  • [3] BiPPred: Combined sequence- and structure-based prediction of peptide binding to the Hsp70 chaperone BiP
    Schneider, Markus
    Rosam, Mathias
    Glaser, Manuel
    Patronov, Atanas
    Shah, Harpreet
    Back, Katrin Christiane
    Daake, Marina Angelika
    Buchner, Johannes
    Antes, Iris
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2016, 84 (10) : 1390 - 1407
  • [4] Sequence- and Structure-Based Analysis of Tissue-Specific Phosphorylation Sites
    Karabulut, Nermin Pinar
    Frishman, Dmitrij
    PLOS ONE, 2016, 11 (06):
  • [5] Sequence- and Structure-Based Immunoreactive Epitope Discovery for Burkholderia pseudomallei Flagellin
    Nithichanon, Arnone
    Rinchai, Darawan
    Gori, Alessandro
    Lassaux, Patricia
    Peri, Claudio
    Conchillio-Sole, Oscar
    Ferrer-Navarro, Mario
    Gourlay, Louise J.
    Nardini, Marco
    Vila, Jordi
    Daura, Xavier
    Colombo, Giorgio
    Bolognesi, Martino
    Lertmemonkolchai, Ganjana
    PLOS NEGLECTED TROPICAL DISEASES, 2015, 9 (07): : 1 - 20
  • [6] Analysis of protein missense alterations by combining sequence- and structure-based methods
    Gyulkhandanyan, Aram
    Rezaie, Alireza R.
    Roumenina, Lubka
    Lagarde, Nathalie
    Fremeaux-Bacchi, Veronique
    Miteva, Maria A.
    Villoutreix, Bruno O.
    MOLECULAR GENETICS & GENOMIC MEDICINE, 2020, 8 (04):
  • [7] Increasing the thermostability of sucrose phosphorylase by a combination of sequence- and structure-based mutagenesis
    Cerdobbel, An
    De Winter, Karel
    Aerts, Dirk
    Kuipers, Remko
    Joosten, Henk-Jan
    Soetaert, Wim
    Desmet, Tom
    PROTEIN ENGINEERING DESIGN & SELECTION, 2011, 24 (11): : 829 - 834
  • [8] Genomic-scale comparison of sequence- and structure-based methods of function prediction: Does structure provide additional insight?
    Fetrow, JS
    Siew, N
    Di Gennaro, JA
    Martinez-Yamout, M
    Dyson, HJ
    Skolnick, J
    PROTEIN SCIENCE, 2001, 10 (05) : 1005 - 1014
  • [9] On the diversity of F420-dependent oxidoreductases: A sequence- and structure-based classification
    Mascotti, Maria Laura
    Juri Ayub, Maximiliano
    Fraaije, Marco W.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (11) : 1497 - 1507
  • [10] A comparative study of sequence- and structure-based features of small RNAs and other RNAs of bacteria
    Barik, Amita
    Das, Santasabuj
    RNA BIOLOGY, 2018, 15 (01) : 95 - 103