Pathogenicity Prediction of Single Amino Acid Variants With Machine Learning Model Based on Protein Structural Energies

被引:1
|
作者
Wu, Tzu-Hsuan [1 ]
Lin, Peng-Chan [2 ]
Chou, Hsin-Hung [3 ]
Shen, Meng-Ru [4 ]
Hsieh, Sun-Yuan [1 ,5 ]
机构
[1] Natl Cheng Kung Univ, Inst Med Informat, Tainan 701, Taiwan
[2] Natl Cheng Kung Univ Hosp, Dept Comp Sci & Informat Engn, Dept Internal Med, Tainan 704, Taiwan
[3] Natl Chi Nan Univ, Dept Comp Sci & Informat Engn, Puli Township 54516, Nantou County, Taiwan
[4] Natl Cheng Kung Univ, Dept Obstet & Gynecol, Dept Pharmacol, Coll Med, Tainan 701, Taiwan
[5] Natl Cheng Kung Univ, Inst Mfg Informat Syst, Dept Comp Sci & Informat Engn, Tainan 701, Taiwan
关键词
Machine learning; pathogenicity prediction; protein structure energy; single amino acid variants; SNP; MUTATIONS; POLYMORPHISMS;
D O I
10.1109/TCBB.2021.3139048
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The most popular tools for predicting pathogenicity of single amino acid variants (SAVs) were developed based on sequence-based techniques. SAVs may change protein structure and function. In the context of van derWaals force and disulfide bridge calculations, no method directly predicts the impact of mutations on the energies of the protein structure. Here, we combined machine learning methods and energy scores of protein structures calculated by Rosetta Energy Function 2015 to predict SAV pathogenicity. The accuracy level of our model (0.76) is higher than that of six prediction tools. Further analyses revealed that the differential reference energies, attractive energies, and solvation of polar atoms between wildtype and mutant side-chains played essential roles in distinguishing benign from pathogenic variants. These features indicated the physicochemical properties of amino acids, which were observed in 3D structures instead of sequences. We added 16 features to Rhapsody (the prediction tool we used for our data set) and consequently improved its performance. The results indicated that these energy scores were more appropriate and more detailed representations of the pathogenicity of SAVs.
引用
收藏
页码:606 / 615
页数:10
相关论文
共 50 条
  • [31] Structural deformation prediction model based on extreme learning machine algorithm and particle swarm optimization
    Jiang, Shouyan
    Zhao, Linxin
    Du, Chengbin
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2022, 21 (06): : 2786 - 2803
  • [32] Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs
    Shamim, Mohammad Tabrez Anwar
    Anwaruddin, Mohammad
    Nagarajaram, H. A.
    BIOINFORMATICS, 2007, 23 (24) : 3320 - 3327
  • [33] QoE Prediction Model for IPTV based on Machine Learning
    Meng, Hao
    Huang, Ruochen
    Wei, Xin
    Qian, Yi
    Liu, Qifeng
    2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [34] Prediction Model of Ischemic Stroke Based on Machine Learning
    Zhang, Zhijie
    Zou, Zhihong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (05)
  • [35] A Machine Learning Model for Wave Prediction Based on Support Vector Machine
    Liu, Qiang
    Feng, Xingya
    Tang, Tianning
    INTERNATIONAL JOURNAL OF OFFSHORE AND POLAR ENGINEERING, 2022, 32 (04) : 394 - 401
  • [36] Ensemble learning model for Protein-Protein interaction prediction with multiple Machine learning techniques
    Lai, Zhenghui
    Li, Mengshan
    Chen, Qianyong
    Gu, Yunlong
    Wang, Nan
    Guan, Lixin
    MEASUREMENT, 2025, 242
  • [37] Using Chou's Amphiphilic Pseudo-amino Acid Composition and Extreme Learning Machine for Prediction of Protein-protein Interactions
    Huang, Qiao-Ying
    You, Zhu-Hong
    Li, Shuai
    Zhu, Zexuan
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2952 - 2956
  • [38] Prediction of Protein-Protein Interactions from Amino Acid Sequences using Extreme Learning Machine Combined with Auto Covariance Descriptor
    You, Zhu-Hong
    Li, Liping
    Ji, Zhen
    Li, Min
    Guo, Sen
    2013 IEEE WORKSHOP ON MEMETIC COMPUTING (MC), 2013, : 80 - 85
  • [39] A machine learning model for parameter correlation analysis and structural deformation prediction
    Chen, Cheng
    Wang, Zhansheng
    Shi, Peixin
    Jia, Pengjiao
    2022 INTERNATIONAL CONFERENCE ON MECHANICAL, AUTOMATION AND ELECTRICAL ENGINEERING, CMAEE, 2022, : 13 - 19
  • [40] QAcon: single model quality assessment using protein structural and contact information with machine learning techniques
    Cao, Renzhi
    Adhikari, Badri
    Bhattacharya, Debswapna
    Sun, Miao
    Hou, Jie
    Cheng, Jianlin
    BIOINFORMATICS, 2017, 33 (04) : 586 - 588