Protein fold families prediction based on graph representations and machine learning methods

被引:0
|
作者
Areiza-Laverde, H. J. [1 ]
Mercado-Diaz, L. R. [1 ]
Castro-Ospina, A. E. [1 ]
Jaramillo-Garzon, J. A. [1 ]
机构
[1] Inst Tecnol Metropolitano, Medellin, Antioquia, Colombia
关键词
STRUCTURAL CLASSIFICATION; STRUCTURE ALIGNMENT; DATABASE; SCOP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prediction of protein fold families remains an existing challenge in molecular biology and bioinformatics, mainly because proteins form a broad range of complex three-dimensional configurations and because the number of proteins registered in datasets has dramatically increased in the recent years. Computational alternatives must then be designed for substituting experimental methods. However, implementations of computational methods have found a problem to extract features that involve the physical-chemical attributes and spatial features of the protein to improve the accuracy in predictions. In this paper, we propose the use of graph theory for representing position of amino acids of the protein as graph nodes, and graph edges connect amino acids that are close to each other under a given threshold. In this way we can get very descriptive features related to spatial and physical-chemical properties of the proteins to describe their three-dimensional structure and so predict the protein fold families with a good accuracy.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Protein pKa Prediction by Tree-Based Machine Learning
    Chen, Ada Y.
    Lee, Juyong
    Damjanovic, Ana
    Brooks, Bernard R.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2022, 18 (04) : 2673 - 2686
  • [42] Advances in Protein Contact Map Prediction Based on Machine Learning
    Xie, Jiang
    Ding, Wang
    Chen, Luonan
    Guo, Qiang
    Zhang, Wu
    MEDICINAL CHEMISTRY, 2015, 11 (03) : 265 - 270
  • [43] Machine learning for graph-based representations of three-dimensional discrete fracture networks
    Manuel Valera
    Zhengyang Guo
    Priscilla Kelly
    Sean Matz
    Vito Adrian Cantu
    Allon G. Percus
    Jeffrey D. Hyman
    Gowri Srinivasan
    Hari S. Viswanathan
    Computational Geosciences, 2018, 22 : 695 - 710
  • [44] Machine learning for graph-based representations of three-dimensional discrete fracture networks
    Valera, Manuel
    Guo, Zhengyang
    Kelly, Priscilla
    Matz, Sean
    Cantu, Vito Adrian
    Percus, Allon G.
    Hyman, Jeffrey D.
    Srinivasan, Gowri
    Viswanathan, Hari S.
    COMPUTATIONAL GEOSCIENCES, 2018, 22 (03) : 695 - 710
  • [45] A systematic benchmark of machine learning methods for protein-RNA interaction prediction
    Horlacher, Marc
    Cantini, Giulia
    Hesse, Julian
    Schinke, Patrick
    Goedert, Nicolas
    Londhe, Shubhankar
    Moyon, Lambert
    Marsico, Annalisa
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [46] SMOTE Based Protein Fold Prediction Classification
    Vani, K. Suvarna
    Bhavani, S. Durga
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 2, 2013, 177 : 541 - +
  • [47] Molecular Representations in Machine-Learning-Based Prediction of PK Parameters for Insulin Analogs
    Einarson, Kasper A.
    Bendtsen, Kristian M.
    Li, Kang
    Thomsen, Maria
    Kristensen, Niels R.
    Winther, Ole
    Fulle, Simone
    Clemmensen, Line
    Refsgaard, Hanne H. F.
    ACS OMEGA, 2023, 8 (26): : 23566 - 23578
  • [48] Protein pKa Prediction with Machine Learning
    Cai, Zhitao
    Luo, Fangfang
    Wang, Yongxian
    Li, Enling
    Huang, Yandong
    ACS OMEGA, 2021, 6 (50): : 34823 - 34831
  • [49] Machine Learning for Prediction of Protein Properties
    Kool, Daniel
    ProQuest Dissertations and Theses Global, 2023,
  • [50] Machine learning in protein structure prediction
    AlQuraishi, Mohammed
    CURRENT OPINION IN CHEMICAL BIOLOGY, 2021, 65 : 1 - 8