Sequence-based Protein-Protein Interaction Prediction using Greedy Layer-Wise Training of Deep Neural Networks

被引:5
|
作者
Hanggara, Faruq Sandi [1 ]
Anam, Khairul [1 ]
机构
[1] Univ Jember, Jember Regency, Indonesia
关键词
D O I
10.1063/5.0014721
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Jamu is an herbal medicine commonly used before the advent of modem medicine. Generally, the herbal formula is obtained empirically and passed down from generation to generation. However, the healing process with herbs is also influenced by such as myths and local customs. This influence causes differences in the use of herbal ingredients to cure the same disease. The result is a collection of herbal recipes that overlap each other without any supporting evidence of its validity. Protein-protein interaction (PPI) is a biological process that is influenced by drugs in the healing process. Therefore, PPI due to the consumption of herbs can be used as evidence of the effectiveness of herbal medicine. PPI analysis needs to be done to study how proteins interact with other proteins. PPI analysis with an experimental method (wet lab) cannot be carried out on extensive data and only covers a portion of protein interaction networks. Therefore, a computational approach needs to be done. In previous studies, predictions of PPIs were proven to be carried out using only protein sequence information. The advantage of using this protein sequence information is that this method is more universal. Information that can be obtained from protein sequences includes Discrete Cosine Transform, Multi-scale Local Descriptor, Autocovariance, and Conjoint Triad. The study with the sequence information has been done using different machine learning approaches, such as Support Vector Machines, Random Forest, and Probabilistic Neural Networks. A deep learning approach has also been done with Stacked-Autoencoder, which tried to construct a hidden structure of protein sequences. Previously, deep learning has also been proven to be able to handle raw and complex data on a large scale and learn the useful and abstract features of perceptual problems such as image recognition and voice. The method proposed in this study is deep neural networks that were trained using stacked-autoencoder and stacked-randomized autoencoder. The extracted features used are conjoint-triad. This study compares both methods which have different characteristics in the construction of layers in deep neural networks. We conducted experiments with k-Fold cross-validation which became the gold standard for most predictive model testing. Our experiments with 5 cross-validations and 3 hidden layers gave an average validation accuracy of 0.89 +/- 0.02 for the SAE method and 0.51 +/- 0.003 for the ML-ELM.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Cracking the black box of deep sequence-based protein-protein interaction prediction
    Bernett, Judith
    Blumenthal, David B.
    List, Markus
    [J]. BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [2] MULTIMODAL PRE-TRAINING MODEL FOR SEQUENCE-BASED PREDICTION OF PROTEIN-PROTEIN INTERACTION
    Xue, Yang
    Liu, Zijing
    Fang, Xiaomin
    Wang, Fan
    [J]. MACHINE LEARNING IN COMPUTATIONAL BIOLOGY, VOL 165, 2021, 165 : 34 - 46
  • [3] Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction
    Khatun, Mst Shamima
    Shoombuatong, Watshara
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    [J]. CURRENT GENOMICS, 2020, 21 (06) : 454 - 463
  • [4] Supervised Greedy Layer-Wise Training for Deep Convolutional Networks with Small Datasets
    Rueda-Plata, Diego
    Ramos-Pollan, Raul
    Gonzalez, Fabio A.
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 : 275 - 284
  • [5] Voice Conversion Using Deep Neural Networks With Layer-Wise Generative Training
    Chen, Ling-Hui
    Ling, Zhen-Hua
    Liu, Li-Juan
    Dai, Li-Rong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1859 - 1872
  • [6] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Sun, Tanlin
    Zhou, Bo
    Lai, Luhua
    Pei, Jianfeng
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [7] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Tanlin Sun
    Bo Zhou
    Luhua Lai
    Jianfeng Pei
    [J]. BMC Bioinformatics, 18
  • [8] Sequence-based protein-protein interaction prediction via support vector machine
    Yongcui Wang
    Jiguang Wang
    Zhixia Yang
    Naiyang Deng
    [J]. Journal of Systems Science and Complexity, 2010, 23 : 1012 - 1023
  • [9] DeNovo: virus-host sequence-based protein-protein interaction prediction
    Eid, Fatma-Elzahraa
    ElHefnawi, Mahmoud
    Heath, Lenwood S.
    [J]. BIOINFORMATICS, 2016, 32 (08) : 1144 - 1150
  • [10] Sequence-based protein-protein interaction prediction via support vector machine
    Wang, Yongcui
    Wang, Jiguang
    Yang, Zhixia
    Deng, Naiyang
    [J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2010, 23 (05) : 1012 - 1023